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In this paper and a companion paper, we show how the framework of information geometry, a 
geometry of discrete probability distributions, can form the basis of a derivation of the quantum 
formalism. The derivation rests upon a few elementary features of quantum phenomena, such 
as the statistical nature of measurements, complementarity, and global gauge invariance. It is 
shown that these features can be traced to experimental observations characteristic of quantum 
phenomena and to general theoretical principles, and thus can reasonably be taken as a starting point 
of the derivation. When appropriately formulated within an information geometric framework, these 
features lead to (i) the abstract quantum formalism for finite- dimensional quantum systems, (ii) the 
result of Wigner's theorem, and (iii) the fundamental correspondence rules of quantum theory, 
such as the canonical commutation relationships. The formalism also comes naturally equipped 
with a metric (and associated measure) over the space of pure states which is unitarily- and anti- 
unitarily invariant. The derivation suggests that the information geometric framework is directly or 
indirectly responsible for many of the central structural features of the quantum formalism, such as 
the importance of square-roots of probability and the occurrence of sinusoidal functions of phases 
in a pure quantum state. Global gauge invariance is seen to play a crucial role in the emergence of 
the formalism in its complex form. 



I. INTRODUCTION 

The formalism of quantum theory has many mathe- 
matical features, such as its use of complex vector spaces, 
whose physical origin is obscure. Over the last two 
decades, there has been growing interest in the eluci- 
dation of the origin of these features by attempting to 
derive the quantum formalism from a set of physical as- 
sumptions, motivated by the belief that such an eluci- 
dation would contribute to the development of a clearer 
physical understanding of quantum theory, and might 
contribute to the development of a theory of quantum 
gravity [3,111 1 1. 

The interest in reconstructing the quantum formalism 
in this way has been encouraged by belief in the hypoth- 
esis that the concept of information might be the key, 
hitherto missing, ingredient which, if appropriately con- 
ceptualized and formalized, might make such a recon- 
struction possible. In recent years, several attempts have 
been made to systematically explore the reconstruction 
of the quantum formalism from an informational starting 
point , for example 0, S, 0, I, & El, El El El, El EES • 
Perhaps the earliest such work, due to Wootters Q, is 
remarkable for showing that one can derive a correct, 
non-trivial physical prediction (Malus' law) concerning a 
quantum experiment from a simple information-theoretic 
principle that concerns the amount of information ob- 
tained when measurements are performed upon a sys- 
tem, thereby providing clear support to the idea that the 
quantum formalism owes at least part of its mathemat- 
ical structure to quantitative measures of information. 
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However, the attempt to extend these principles in the 
direction of the quantum formalism meets with limited 
success. Other attempts to quantify the gain of infor- 
mation and to formulate information-theoretic principles 
have also been made in 

I, IEEE!, but they are sim- 
ilarly restricted in their sco pe. More recently, a number 
of approaches @, El, El, El, EE El 

succeed in deriving a 
significant portion of the abstract formalism of quantum 
theory, but at the cost of abstract assumptions (typi- 
cally involving the assumption of the complex number 
field) that play a key role [11], which significantly de- 
tracts from the understanding of the physical origin of 
the quantum formalism that they can provide. The ap- 
proach presented here suggests that it is possible to derive 
a large part of the quantum formalism without recourse 
to abstract assumptions of this nature [34[ . 

One way to understand the reason for thinking that 
information might have a fundamental role to play in 
our understanding of the physical world is as follows. 
Suppose that an experimenter is presented with a sys- 
tem prepared in a way unknown to her. In classical 
physics, she can, in principle, perform an ideal measure- 
ment upon the system which provides complete informa- 
tion about the state of the system. We shall describe 
this situation as one of transparency: intuitively, there is 
no insurmountable barrier between the system and the 
observer, so that, in principle, there is no fundamental 
distinction between the state of the system (the theoret- 
ical description of the reality) on the one hand, and the 
experimenter's information about the state on the other. 
On the other hand, when the same situation is described 
using quantum physics, an ideal measurement performed 
on the system provides the experimenter with only par- 
tial information about the state of the quantum system. 
Hence, transparency no longer holds. Now, one might 
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imagine that the experimenter could bypass this restric- 
tion by performing a large number of measurements on 
an ensemble of identically-prepared systems. However, if 
one assumes that an experimenter can, in principle, only 
perform a finite number of such measurements in a finite 
time, then, since the information that can be obtained by 
a finite number of measurements still only provides par- 
tial information, it follows that a fundamental distinction 
is drawn between the state of the system and the infor- 
mation that the experimenter can possibly have about 
the state. 

It is then natural to ask how the information that the 
experimenter gains can be quantified, and it seems rea- 
sonable to suppose that the information gain, and the 
manner in which this information is theoretically repre- 
sented and manipulated, might be constrained by prin- 
ciples which give rise to non-trivial mathematical struc- 
ture. This is the line of thinking we attempt to develop 
in this paper. 

In the quantum description of the above situation, the 
limitation imposed on the information that the experi- 
menter obtains can be traced directly to the fact that, 
in the quantum case, the state of the system only de- 
termines the probabilities of the results of a measure- 
ment performed upon it, a property we will refer to 
as statistical determination. As a consequence, a fi- 
nite number of runs of an experiment where a measure- 
ment is performed on a system prepared in some un- 
known state only yields limited information about the 
state. For example, if the system is in the state v = 
(y/pTe^ 1 , v /p2~e (fe , ■ ■ ■ , v / PNe 1<t ' N ) T in the basis of a pro- 
jective measurement, A, where the pi are the measure- 
ment probabilities and the 4>i are phases, then n runs of 
an experiment where measurement A is performed yield 
frequencies f±, fa, . . . , /jv, with /j being the frequency of 
result i, which provide only a finite amount of informa- 
tion about the pi. 

In order to focus upon statistical determinacy and 
quantitatively explore its consequences, it is helpful iso- 
late this feature, and to consider the process of informa- 
tion gain in the context of classical probability theory. 
Consider the following simple situation. Alice has two 
coins, A and B, characterized by the probability distri- 
butions p = (^1,^2) and p' = (fijPa)' respectively. Sup- 
pose that she chooses coin A, tosses it n times, and then 
sends the data to Bob, without disclosing which coin she 
chose. If Bob knows p and p', how much information 
does the data provide him about which coin was tossed? 

Using Bayes' theorem and Stirling's approximation 
for the case where n is large, on the assumption that 
coins A and B are a priori equally likely to be cho- 
sen, one finds that Pa/Pb = exp (nY%=iPifo(Pi/p'i)\ 
where Pa is the probability that the tossed coin is A 
given the data, and likewise for Pg. When the proba- 
bility distributions are close, so that p' = p + dp, the 
argument of the exponent can be expanded in the dpi to 
give Pa/Pb = exp(2nds 2 ), where ds 2 = \ J2i dPi/Pi- 

Now, the information gained by Bob, AI, is defined 



as AI = [7(1/2,1/2) - U{P A ,P B ), with U being an 
entropy (uncertainty) function such as the Shannon en- 
tropy. But, since Pa + Pb = 1 and Pa/Pb is determined 
by ds, once U is selected, AI is determined by ds. This 
result immediately generalizes to the case where p and p' 
are Af-dimensional probability distributions [M > 2). 
Hence, from an informational viewpoint, it is natural to 
endow the space of discrete probability distributions with 
a Riemannian metric, ds 2 = jJ2i^Pi/Pi- This metric, 
known as the information metric, is the central compo- 
nent of information geometry [l7j . 

In the course of this paper and a companion pa- 
per [IU (hereafter referred to as Paper II), we show that, 
by formalizing some elementary features of quantum phe- 
nomena (particularly complementarity and global gauge 
invariance) within this information-geometric framework, 
and supplementing these with a few additional plausible 
assumptions, it is possible to reconstruct the quantum 
formalism. In particular, we derive: 

1. The finite-dimensional abstract quantum formal- 
ism, namely (a) the von Neumann postulates 
for finite-dimensional systems, and (b) the tensor 
product rule for expressing the state of a composite 
system in terms of the states of its subsystems. 

2. The result due to Wigner that a one-to-one map 
over the state space of a system which represents 
a symmetry transformation of the system is either 
unitary or antiunitary [l9l |. 

3. The principal correspondence rules of quantum the- 
ory [351 ]. such as the canonical commutation rela- 
tion [x, pj = ih. 

In addition, we obtain the unitarily- and antiunitarily- 
invariant metric ds 2 = |c?v| 2 , and associated measure, 
over the space of pure states. 

The present paper is organized as follows. We begin 
in Sec. HI by providing a brief overview of the main ideas 
and the main steps in the derivation. 

In Sec. lIII Al we formulate a set of background assump- 
tions and idealizations, and an experimental framework. 
The experimental framework precisely delineates which 
experimental set-ups can be described by the theoretical 
model that is to be developed. By proceeding in this 
manner, we ensure at the outset that there is no ambigu- 
ity as to what set of experimental set-ups we are seeking 
to model. In Sec. IIII Bl we present a set of postulates 
which determine the theoretical model of a system in the 
context of this experimental framework. 

In Sec. IIV1 it is shown how the majority of these pos- 
tulates can be regarded as reasonable generalizations of 
elementary experimental facts that are characteristic of 
quantum phenomena, or arc drawn from classical physics, 
and so can be regarded as a reasonable starting point for 
a reconstruction of quantum theory. 

In Sec. El we show how these postulates give rise to the 
finite-dimensional abstract quantum formalism (apart 
from the form of the temporal evolution operator) and 
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Wigner's result, as well to a metric and measure over 
the space of pure states. We conclude in Sec. [Vl] with a 
discussion of the results. 

In Paper II, we formulate an additional principle (the 
Average- Value Correspondence Principle), and use this 
to obtain the form of the temporal evolution operator and 
to obtain the correspondence rules of quantum theory. 



II. OUTLINE OF THE DERIVATION 

The outline below is presented in four parts. First, 
we express the notion of complementarity, which con- 
sists of the idea that when a measurement is performed 
upon a system in some state, the measurement result 
only yields information about half of the experimentally- 
accessible degrees of freedom of the state. Complemen- 
tarity is expressed by the assertion that, in the case of a 
measurement that yields N possible results, each result 
coarse-grains over two objectively-realized outcomes of 
the measurement. The outcomes are assumed to be sta- 
tistically determined by the state of the system, and are 
thus characterized by a probability distribution. The as- 
sumed statistical determinism of the outcomes motivates 
the imposition of the information metric over the space 
of probability distributions. Using these two ideas, we 
obtain a formalism in which states of a system are repre- 
sented by unit vectors in a 2/V-dimensional real Euclidean 
space, and physical transformations are represented by 
orthogonal transformations of these vectors. 

Second, the state is expressed in terms of the prob- 
abilities of the measurement results and a set of N real 
variables, xi> ■ ■ ■ > Xn- The assertion that the transforma- 
tion Xi —> Xi+Xo is, for any xo, is a global gauge transfor- 
mation of the Xi implies that the resulting formalism can 
be expressed in complex form, such that states are repre- 
sented by TV-dimensional complex vectors, and physical 
transformations are represented by unitary or antiunitary 
transformations. 

Third, it is asserted that any measurement describ- 
able by the formalism can be simulated in terms of 
any given measurement flanked by suitable interactions, 
which yields the Born rule and the Hermitian represen- 
tation of measurements. 

Fourth, we assume that, in the special case of a system 
of definite energy in stationary state whose observable de- 
grees of freedom are time-independent, the temporal rate 
of change of the overall phase of the state is proportional 
to the energy of the system. In conjunction with the 
global gauge invariance condition, this assumption yields 
the tensor product rule. 

Finally, we indicate the further steps that are neces- 
sary to derive the explicit form of the temporal evolution 
operator, and the correspondence rules of quantum the- 
ory. 



Part 1: Constructing State Space 

Measurement is idealized as a process that (i) when 
performed upon some physical system, yields one of N 
possible results, with probabilities, pi, .. . ,pjv, that are 
determined by the state of the system immediately prior 
to the measurement, and (ii) is reproducible, so that, 
upon immediate repetition of the measurement, the same 
result is obtained with certainty. 

1. Formalizing Complementarity 

Complementarity is incorporated by assuming that, 
when measurement A is performed, there are, in fact, 
2N possible outcomes, but these outcomes are not indi- 
vidually observed. Result i is observed (i = l,...,N) 
whenever either outcome 2i — 1 or outcome 2i is real- 
ized (see Fig.[I|. The probabilities of outcomes 1, . . . , 2N 
are given by P 1 ,...,P 2N , so that p { = P 2i -\ + Pii- 
The P q (q = 1, . . . ,2N) are summarized by the proba- 
bility n-tuple P = (Pi, . . . , P 2N ). 




FIG. 1: Complementarity Postulate. Probability tree show- 
ing the outcomes of measurement A, and the corresponding 
results (circled) that are observed. One of 2N possible out- 
comes are realized, with probability P\, P2, . . . , P2N, respec- 
tively. The measurement is unable to resolve the individual 
outcomes. Result i is obtained whenever either outcome 2i — 1 
or outcome 2i is realized (i = 1, . . . , N). 

Intuitively, performing the measurement brings about 
the realization of one of 27V possible outcomes, but the 
results of the measurement coarse-grain over these out- 
comes: when outcomes 2i — 1 or 2i is realized, the mea- 
surement is (for some reason to be investigated) unable 
to resolve the individual outcomes, so that only result i 
is registered. 

2. Imposing the Information Metric 

Once one assumes that measurement outcomes are 
only statistically determined by the state of the system, it 
follows that it is impossible in general to determine with 
certainty whether a given system is in one of two possible 
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states on the basis of the outcomes of a finite number of 
measurements performed upon identical copies of the sys- 
tem. Instead, only a finite amount of information about 
which state is present can be obtained. Consequently, as 
discussed in the Introduction, it is natural to endow the 
space of the P with the information metric, 

ds2 = -J2 dp2 J p «- a) 

<? 

It is convenient to define Q q = \fP~ qi since the metric 
over the Q q is then simply the Euclidean metric, 

ds 2 =dQf + --- + dQt N . (2) 

so that Q is a unit vector that lies on the positive orthant 
of the unit hypersphere, 5 2Af_1 , in a 27V-dimensional real 
Euclidean space. 

3. Representing Physical Transformations 

Next, we consider transformations which represent 
physical transformations of the system. We seek one-to- 
one transformations that preserve the information met- 
ric over P, thereby endowing the information metric 
with a fundamental physical significance. Now, if one 
takes the Q themselves as the state space of the system, 
one finds that non-trivial one-to-one transformations of 
the state space that preserve the Euclidean metric are 
not possible. A simple way to allow the existence of 
such transformations is to take the entire unit hyper- 
sphere, .S 2iV_1 , as the state space of the system. Thus, 
we shall postulate that a state of the system is given by 
a unit vector Q = (Qi, Q2, ■ ■ ■ , Q2N), with Q q e [— 1, 1], 
and that the probabilities P q are given by P q — Q 2 . From 
the information metric over the P, it follows that the 
metric over the Q is Euclidean. Hence, Q lies on the 
unit hypersphere, 5 2JV_1 , in a 2iV-dimensional real Eu- 
clidean space. 

The signs of the Q q determine in which of the 2 2N 
possible orthants of 5 ,2Af_1 that Q lies. We shall refer 
to the sign of Q q as the polarity of outcome q, which 
is defined only when P q ^ 0, and is physically realized 
whenever outcome q is realized, but is unobserved. 

We postulate that any transformation, jVf, of S 2N ~ l , 
that represents a physical transformation of the system is 
one-to-one and preserves the metric. It follows that M. is 
an orthogonal transformation of S 21 ^^ 1 , so that state Q is 
transformed to Q' = A/Q, where M is a 27V-dimensional 
real orthogonal matrix. 

Part 2: Global Gauge Invariance 

We begin by expressing the state, Q, in terms of the 
probabilities pi,p2, ■ ■ ■ iPn, and N additional real degrees 
of freedom, XI1X2, ■ • ■ ,XN- In particular, we postulate 



that the pi and Xi together determine the Q q through 
the relations Q 2 i-i = ■ s /pif{Xi) and Qii = y/Pif(Xi)> 
where / is not a constant function, and / and / are 
differentiable functions to be determined. 

This change of variables can be understood more di- 
rectly as follows. Let the outcomes 1,2,..., 2N be re- 
labeled as la, 16, 2a, 26, ... , Nb, respectively, where the 
number (1, . . . , N) indicates the observed result and the 
letter (a or 6) indicates which of the two possible out- 
comes compatible with the observed result was realized. 
It then follows from the above assumption that, given 
result i is observed, the probability that outcome la was 
realized is p a u — / 2 (Xi) and, similarly, the probability 
that outcome 16 was realized is pm — f 2 (xi)- The polar- 
ities (when defined) associated with outcomes ia and ib 
are given by the signs of f(Xi) and f(Xi), respectively (see 
Fig.©. 




FIG. 2: State Representation Postulate. Probability tree 
showing the outcomes (relabeled as la, 16, ... , Nb) of mea- 
surement A, redrawn to place results (circled) at the top. Re- 
sult i is observed with probability pi. Given that i is observed, 
outcome la or 16 is realized with probability p a u = / 2 (x0 
or Pb\i = f 2 (Xi)y respectively. The polarities (when denned) 
associated with outcomes ia and ib are given by the signs 
of f(Xi) an d f(Xi)i respectively. The individual outcomes ia 
and ib, and their polarities, are unresolved by the measure- 
ment. 



4- Formalizing Global Gauge Invariance 

On the basis of a classical-quantum correspondence 
argument, we are led to the assumption that, for 
the model of a system where the state as represented 
by (pi!Xi)j the transformation Xi ~* Xi + Xo f° r * = 
1, . . . , N is a gauge transformation, and therefore causes 
no change in the predictions of the model. From 
this assumption, we immediately draw two postulates. 
First, that the measure [i{jPi", Xi) ■> induced by the metric 
over S 21 ^^ 1 is consistent with this global gauge condition, 
that is n(p 1 ,...,p N ;xi,---,XN) = • • • ,Pn; Xi + 
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Xo, ■ • ■ , Xn + Xo) for all xo ; which we find implies that 

/(Xi) = cos(aXt + 6) 
fiXi) = sin(ax l + 6), 

where d^O and 6 are constants, so that 

Q = (v^iCos^i.-y/PiSin^i,..., ^ N sin(p N ), (4) 

where 0j = a\i + b. 

Second, we postulate that the result probabilities cal- 
culated from state Q' = MQ are unaffected by the 
above gauge transformation of the Xi i n Q ( sce Fig- EJ • 
Remarkably, one finds that every orthogonal transforma- 
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measurement A which produces a measurement result 
and a system in some outgoing state, and (c) some in- 
teraction, I', represented by a unitary transformation, V, 
then acts upon the system to generate a system in some 
output state. From the reproducibility of A and A', one 




Input 
System 



FIG. 4: Measurement Simulability Postulate. An arrangement 
showing measurement A flanked by interactions I and I'. 
The arrangement simulates measurement A' insofar as its 
measurement probabilities and corresponding possible output 
states are concerned. 



immediately obtains the Born rule. 



FIG. 3: Gauge Invariance Postulate. Along the top, a system 
in a state represented by (p%,Xi) evolves under map M to 
state represented by (p'i;x'i)- Along the bottom, a system in 
a state represented by (p;; Xi + Xo) evolves under map A4 to 
a state represented (p";xH- The postulate asserts that, for 
any xo, one has p\ = p" for i = 1, . . . , N. 

tion, M, of Q that satisfies this postulate can be rewrit- 
ten as a unique unitary or antiunitary transformation of 

v = (Qi + 1Q2, ■ ■ ■ ,Qin-i + iQinY ■ (5) 

Conversely, one finds that every unitary or antiunitary 
transformation of v is equivalent to a unique orthogonal 
transformation of Q which satisfies the global invariance 
postulate. That is, the set of all orthogonal transforma- 
tions of 5' 2Ar ~ 1 is in one-to-one correspondence with the 
set of all unitary and antiunitary transformations of the 
space of unit vectors v. Hence, any physical transfor- 
mation of the system in state v can be represented by a 
unitary or antiunitary transformation of v. 



Part 4: Composite Systems 

On the basis of the above-mentioned classical-quantum 
correspondence argument, we are led to the assumption 
that the state of a system of definite energy, E, whose 
observable degrees of freedom are time-independent, and 
in a time-independent background, evolves as {jPi\Xi) ~ > 
(Pii Xi~ EAt/a) during the interval [t, t + At], where a ^ 
is a constant. 

Using this assumption and the above-mentioned global 
gauge condition, we obtain that, if a system consists of 
two subsystems in states (pi',Xi) an d (p'j',x'j): respec- 
tively, where i = 1 , . . . , TV and j = 1 , . . . , N , then the 
system has state (p";x") = ip%p'j]Xi + Xj), where I = 
N'(i — 1) + j. The tensor product rule follows immedi- 
ately. 



Summary of Further Steps 



Part 3: Representation of Measurements 

Generalizing from the fact that one can simulate any 
Stern-Gerlach measurement in terms any given Stern- 
Gerlach measurement flanked by suitable magnetic fields, 
we postulate that any reproducible measurement, A', de- 
scribable in the formalism can be simulated by an ar- 
rangement (see Fig.U]) where (a) an interaction, I, repre- 
sented by a unitary transformation, U, acts on a system 
in some input state, (b) the resulting system undergoes 



In Paper II, a second classical-quantum correspon- 
dence argument is given that leads to the Average- Value 
Correspondence Principle which, in conjunction with the 
above assumption concerning the temporal evolution of 
a system, yields the explicit form of the unitary oper- 
ator, Ut(dt) — exp(—iHdt/H), that represents temporal 
evolution of a system during the interval [t, t+dt] in terms 
of the Hamiltonian operator, H. Using the Average- Value 
Correspondence Principle, we then obtain the principal 
correspondence rules of quantum theory. 
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III. EXPERIMENTAL SET-UP AND 
POSTULATES 



A. Abstract Experimental Set-up 

A model of a physical system is a theoretical structure 
that allows one to make predictions about the behav- 
ior of the system in one of a set of possible experimen- 
tal arrangements, where an experimental arrangement is 
abstracted as consisting of a preparation of the system, 
followed by an interaction and a measurement. This set 
of experimental arrangements, which we shall refer to 
as an experimental set, is a subset of all conceivable ex- 
perimental arrangements in which the system could be 
placed. 

Since the primary goal of the present work is to il- 
luminate the physical origin of the quantum formalism, 
it is important to clearly delineate, at the outset, the 
experimental set to which the theoretical model, to be 
developed, is intended to apply. Below, we develop an 
operational procedure to delineate the experimental set, 
which is based on the following ideas. 

Consider an experimental arrangement where, in each 
run, a system (taken from a source of identical systems) 
undergoes a preparation and an interaction, and is then 
subject to a measurement. Given that one's goal is to 
probe the behavior of the system, one ideally wishes to 
prepare the system in such a way that the data obtained 
from the measurement is independent of arbitrary inter- 
actions with the system prior to the preparation. In this 
way, one ensures that the measurement data are not in- 
fluenced by conditions which are not under experimental 
control. We shall say that experimental set-ups of this 
kind are closed (or have the property of closure), and 
we shall restrict our consideration to such set-ups. Using 
the concept of closure, we can then give a systematic pro- 
cedure for generating the set of all closed experimental 
arrangements which, roughly speaking, probe the same 
behavioral aspect of a system as does some given measure- 
ment A. Such a set will be said to be an experimental 
set generated by measurement A. 

For example, any experimental arrangement where sil- 
ver atoms are subject to preparation by a Stern-Gcrlach 
device, undergo an interaction with a uniform magnetic 
field, and finally undergo a Stern-Gerlach measurement, 
is closed in the above sense. Furthermore, the set of all 
such arrangements is an experimental set generated by 
any given Stern-Gerlach measurement. All such set-ups 
probe the same behavioral aspect of the system, namely 
its spin behavior. 

Before we can precisely define closure and give a pro- 
cedure for generating an experimental set, it is necessary 
to formulate a number of fundamental background as- 
sumptions and idealizations, to which we now turn. 



1. Background assumptions and Idealizations 

The theoretical framework of classical physics makes 
the following key background assumptions: 

(a) Partitioning. The universe is partitioned into a 
system, the background environment (or simply, 
the background) [361 ] of the system, measuring ap- 
paratuses, and the rest of the universe. 

(b) Time. In a given frame of reference, one can speak 
of a physical time which is common to the system 
and its background, and which is represented by a 
real-valued parameter, t. 

(c) States. At any time, the system is in a definite 
physical state, whose mathematical description is 
called the mathematical state, or simply the state, 
of the system. The state space of the system is the 
set of all possible states of the system. 

Since these assumptions are not in obvious conflict with 
quantum phenomena, we adopt them unchanged. The 
classical framework also makes two key idealizations con- 
cerning measurements: 

Al Operational Determinism. The result of a mea- 
surement performed on a system is determined by 
experimentally-controllable variables. 

A2 Continuum. The values of the possible results of a 
measurement form a real- valued continuum. 

However, the experimental investigation of elementary 
quantum phenomena, such as Stern-Gerlach measure- 
ments performed on silver atoms, leads to the following 
reasonable modification of these idealizations: 

Al' Statistical Operational Determinism. The data 
obtained when a measurement is performed on 
a system are best modeled by a probabilistic 
source (37j whose probabilities are determined by 
experimentally-controllable variables. 

A2' Finiteness. A measurement performed on a system 
has a finite number of possible results. 

Using these assumptions as a basis, the general ab- 
stract experimental set-up that we shall consider is shown 
in Fig. A source provides identical copies of a physi- 
cal system of interest. A preparation step either selects 
or rejects the incoming system. In a particular run of 
the experiment, a physical system from the source passes 
the preparation, and is then subject to a measurement 
or measurements. 

A measurement is idealized as a process that acts upon 
an input system, and generates an output system to- 
gether with an observed result. The measurement de- 
tectors are assumed not to absorb the systems that they 
detect. Accordingly, a preparation consists of a mea- 
surement that determines to which result the incoming 
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system belongs, followed by the selection of the system if 
the measurement registers a give n result, and the rejec- 
tion of the system otherwise [38|. In addition, following 
the preparation, the system may undergo an interaction 
with a physical apparatus. 

We shall only consider set-ups which satisfy particular 
idealizations. In particular, we shall restrict considera- 
tion to measurements that have the following properties: 

• Distinctness. The possible results of a measure- 
ment have distinct values. 

• Reproducibility. When a measurement is immedi- 
ately repeated, the same result is observed with 
certainty. 

• Classicality. The measurements do not involve aux- 
iliary quantum systems. 

The distinctness assumption excludes, at this stage, 
the consideration of measurements where different re- 
sults cannot be observationally distinguished 39]. The 
assumption of reproducibility is drawn from classical 
physics and is adopted unchanged since it is also a reason- 
able idealization in many quantum experiments. Finally, 
the assumption of classicality ensures that the relevant 
degrees of freedom of the measurement device itself can 
be treated classically (ipj . 

In addition, we shall restrict consideration to interac- 
tions that have the following properties: 

• Integrity-preserving. The interactions preserve the 
integrity of the system. 

• Reversible and Deterministic. The interactions are 
reversible and deterministic at the level of the state 
of the system, and so can be represented as one-to- 
one maps over state space. 

The integrity-preserving assumption ensures that a theo- 
retical model can follow a system during the interaction. 
The reversible and deterministic assumption is drawn 
from classical physics, and is adopted unchanged. 

We shall also assume that the background of the sys- 
tem can be adequately modeled by classical physics in- 
sofar as its internal dynamics is concerned. For example, 
in the case of a system in a background electromagnetic 
field, the field is assumed to be modeled classically. Sim- 
ilarly, we shall assume that parameters which determine 
the measurement being performed (the orientation of a 
Stern-Gerlach apparatus, for instance) are described clas- 
sically as real numbers. In short, it is assumed that the 
non-classicality is entirely concentrated in the system and 
in its interactions with the background and the measure- 
ment devices. 

2. Closed set-ups in the quantum framework 

From assumptions Al' and A2', it follows that, in a 
given experimental set-up, the measurement data ob- 
tained in repeated trials are theoretically characterized 



by a finite set of probabilities. Therefore, the closure 
condition defined earlier, when applied in the context of 
these assumptions, requires that these probabilities are 
independent of the pre-preparation history of the system. 



3. Experimental sets 

The measurements employed in the abstract set-up are 
chosen from a measurement set, A, which we shall de- 
fine below. As mentioned previously, it will be assumed 
that each measurement has the property of finiteness, 
which we shall now operationalize by saying that, when 
the measurement is carried out on a system which has 
been emitted from the source and has undergone arbi- 
trary interactions thereafter [411 ] . the measurement gen- 
erates one of a finite number of possible results, a possible 
result being defined as one that has a non-zero probabil- 
ity of occurrence. 

Consider now an experiment (Fig. [5]) in which a sys- 
tem from a source is subject to a preparation consist- 
ing of measurement, A', with Na> possible results, with 
result j selected (j = 1 , . . . , Na' ) , followed by measure- 
ment A (with Na possible results), without an interac- 
tion in the intervening time. 

Suppose that the data obtained in the experiment 
are characterized by a probabilistic source with Na 
possible results and with probability n-tuple p — 
(pi,P2, ■ ■ ■ ,Pn a ), where pi is the probability of the ith 
result (i = 1,2,..., Na). If, for all j, p is independent 
of arbitrary pre-preparation interactions with the sys- 
tem, the preparation will be said to be complete with 
respect to measurement A. If this completeness condi- 
tion also holds true when A and A' arc interchanged, 
then A and A' will be said to form a measurement pair. 

The set of measurements generated by A forms a mea- 
surement set, A, which is defined as the set of all mea- 
surements that (i) form a measurement pair with A and 
that (ii) are not a composite of other measurements in A. 
An important corollary of this definition is that two mea- 
surement sets are either identical or disjoint. 

Interactions following the preparation step are chosen 
from an interaction set, X, which is defined as follows. 
Suppose that, in the experiment of Fig. [5J an interac- 
tion, I, occurs between the preparation and measure- 
ment. If, for all A, A' G A, the preparation remains 
complete with respect to the subsequent measurement, 
then I will be said to be compatible with A. The set X is 
then defined as the set of all such compatible interactions. 

In terms of these definitions, a closed set-up consists of 
a source of systems where each system is prepared using 
a measurement A' £ A, is then subject to an interac- 
tion I € I, and then undergoes a measurement A e A, 
where A is generated by A, and X is the set of all interac- 
tions compatible with A. The set of all such set-ups will 
be taken as constituting an experimental set, and will be 
said to be generated by measurement A. 

Finally, if there are two experimental set-ups, each 
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FIG. 5: An abstract experimental set-up. In each run of the experiment, a physical system (such as a silver atom) is emitted 
from a source, passes a preparation step, and is then subject to a measurement. An interaction, I, may occur as indicated 
between the preparation and measurement. The preparation is implemented as a measurement, A', which has Na< possible 
results, followed by the selection of those systems which yield some result j (j = 1, 2, . . . , N^')- The measurement, A, has Na 
possible results. 



with a source containing identical copies of the same 
physical system, constructed using measurements from 
measurement sets A^ and A^ 2 \ respectively, then, if 
the measurement sets are disjoint, then the two set-ups 
will be said to be disjoint. 



4- An example 

To illustrate the above definitions, consider an experi- 
ment where silver atoms emerge from a source (an evap- 
orator), pass through a Stern-Gerlach preparation de- 
vice, undergo an interaction, and finally undergo a Stern- 
Gerlach measurement. In this case, one finds experimen- 
tally that the set, A, generated by any Stern-Gerlach 
measurement consists of all Stern-Gerlach measurements 
of the form Ag^, where (9,(f>) is the orientation of the 
Stern-Gerlach device. Measurements that are composed 
of two or more Stern-Gerlach measurements are excluded 
from A by definition. 

Consider now an interaction, le Bt< p B ,t,At, consisting of 
a uniform B-ficld acting during the interval [t, t + At] in 
some direction {6b,4>b)- If such an interaction occurs 
between the preparation and measurement, one finds ex- 
perimentally that the completeness of the preparation 
with respect to the measurement is preserved; that is, 
the interaction is compatible with A. Hence, all inter- 
actions in which a uniform magnetic field acts between 
the preparation and measurement are in the interaction 
set, I. 

The experimental set in this case consists of all set- 
ups consisting of a Stern-Gerlach preparation, an inter- 
action with a uniform magnetic field, followed by a Stern- 
Gerlach measurement. 

Finally, to illustrate the concept of disjoint set-ups, 
consider a source which emits a system consisting of two 
distinguishable spin- 1/2 particles on each run of an ex- 
periment, and consider two set-ups where the first set-up 
is constructed using measurement set ^l*- 1 ) consisting of 
all possible Stern-Gerlach measurements performed on 



one of the particles, and the second is constructed using 
a measurement set A^ consisting of all possible Stern- 
Gerlach measurements performed on the other particle. 
In this case, according to the above definitions, it is found 
that the two measurement sets are disjoint. The set-ups 
themselves are accordingly said to be disjoint. 



5. Some remarks 

The above definitions can be understood intuitively as 
follows. If the measurements A and A' form a mea- 
surement pair, then they can be regarded as probing the 
same behavioral aspect of a system. For example, if mea- 
surements A and A' are Stern-Gerlach measurements, in 
which case A and A' form a measurement pair, both are 
probing the spin behavior of a system. A measurement 
set can then be understood as consisting of all measure- 
ments that probe a given behavioral aspect of a system. 

An interaction which is compatible with a measure- 
ment set can, similarly, be understood as one that does 
not allow the behavioral aspect that is being probed to 
be influenced by degrees of freedom that belong to some 
other behavioral aspect. For example, the effect on a sil- 
ver atom of a uniform magnetic field is only dependent 
upon the spin degrees of freedom of the system, and is 
therefore compatible with a measurement set consisting 
of Stern-Gerlach measurements. However, an interaction 
that affects the spin degrees of freedom but is dependent 
upon the spatial degrees of freedom of the system is not 
compatible with this measurement set. 

Finally, if two set-ups are disjoint, they are probing dis- 
tinct behavioral aspects of the same physical system. For 
example, a system may, in one set-up, be subject to mea- 
surements from a measurement set consisting of Stern- 
Gerlach measurements, and, in another set-up, to mea- 
surements from a measurement set that probe the spatial 
behavior of the system. As we would expect from clas- 
sical physics, these measurement sets are disjoint, which 
can be understood as reflecting the fact that these set- 
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ups are examining disjoint behavioral aspects (spin and 
spatial behavior) of the system. 

B. Statement of the Postulates 

Consider the idealized experiment illustrated in Fig. O 
in which a system passes a preparation step that employs 
a measurement A' in measurement set A, undergoes an 
interaction, I in the interaction set I, and is then subject 
to a measurement, A G A, where A is generated by A, 
and X is the set of all interactions compatible with A. 
The abstract theoretical model that describes this set-up 
satisfies the following postulates. 

1. Measurements 

1.1 Results. When any measurement A G A is 
performed, one of N (N > 2) possible results 
are observed. 

1.2 Measurement Simulability. For any given pair 
of measurements A, A' G A, there exist inter- 
actions I, I' el such that A' can, insofar as 
probabilities of the observed results and inso- 
far as the possible output states of the mea- 
surement are concerned, be simulated by an 
arrangement where I is immediately followed 
by A which, in turn, is immediately followed 
by I' (see Fig.EJ). 

2. States 

2.1 Complementarity. When any given measure- 
ment A G A is performed on the system, one 
of 2N possible outcomes is realized with prob- 
ability Pi, ... , P2N, respectively. The indi- 
vidual outcomes are unobserved. Result i is 
observed whenever outcome (2i — 1) or out- 
come 2i is realized (see Fig. [TJ. 

2.2 States. The state, S, of the system with re- 
spect to measurement A G A is given by Q = 
(Qi,Q 2 , ■ • • ,Q2n) T , where Q q G [-1,1], q = 
1,...,2N. The probability of outcome q is 
given by P q — Q q , and the variable, a q = 
sign(Qg), which is defined if P q ^ 0, is a 
binary degree of freedom (a polarity) associ- 
ated with outcome q and is physically realized 
whenever outcome q is realized, but is unob- 
served. 

2.3 State Representation. The state, S, 
of a system with respect to measure- 
ment A can be represented by the 
pair (p,x) where p = (pi,...,p N ) 
and x — {Xii ■ ■ ■ jXn) are real n-tuples, 
and where pi is the probability that the zth 
result of measurement A is observed. In 
particular, the state is given by Q = 

(VpT<7a|i, \fpiq.b\\> ■ * ■ 5 ^/PNq a \N, ^/PNqb\N), 



where q a \i = f(xi) and = f(xi), where / 
is not a constant function, and where / and / 
are differentiable functions (see Fig.[2|). 

2.4 Information Metric. The metric over P = 
(Pi, P2, . . . , P2N-1, P2N) is the information 
metric, ds* = ± £"=1 dP*/P*. 

2.5 Measure Invariance. The mea- 
sure, n(pi,...,p N ;Xi,---,XN), over p,x 
induced by the metric over Q satisfies 
the condition (j,(pi, . . . ,p^\ xx, ■ ■ ■ , Xn) = 
H<J>i, ■ ■ ■ ,Pn; Xi + Xo, ■ ■ ■ , Xn + Xo) for all xo- 

3. Transformations 

3.1 Mappings. Any physical transformation of the 
system, whether active (due to temporal evo- 
lution) or passive (due to a change of frame of 
reference), is represented by a map, M., over 
the state space, S, of the system. 

3.2 One-to-one. Every map, A4, that represents a 
physical transformation of the system is one- 
to-one. 

3.3 Continuity. If a physical transformation is 
continuously dependent upon the real-valued 
parameter n-tuple 7T, and is represented by the 
map -M.,r, then M n is continuously dependent 
upon it. 

3.4 Continuous Transformations. If repre- 
sents a continuous transformation, then, for 
some value of 7r, M.^ reduces to the identity. 

3.5 Metric Preservation. The map M. preserves 
the metric over the state space, S, of the sys- 
tem. 

3.6 Gauge Invariance. The map M. is such 
that, for any state S G S, the probabili- 
ties, p'liv'i-, ■ ■ ■ iPjvj °f the results of measure- 
ment A G A performed upon a system in 
state S' = A^(S) are unaffected if, in any rep- 
resentation, (pilXi) = (PiX)) °f t ne state S 
written down with respect to A, an arbi- 
trary real constant, ^o, is added to each of 
the Xi (see Fig. [3]). 

3.7 Temporal Evolution. The map, Mt.At, which 
represents temporal evolution of a system in a 
time-independent background during the in- 
terval [t,t + At], is such that any state, S, 
represented as (piJXi), of definite energy E, 
whose observable degrees of freedom are time- 
independent, evolves to (p[;Xi) — (Pi'iXi ~~ 
EAt/a), where a is a non-zero constant with 
the dimensions of action. 

The above postulates, together with the Average- Value 
Correspondence Principle (AVCP), which will be given 
in Paper II, suffice to determine the form of the abstract 
quantum model for the abstract set-up. From the Results 
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postulate, it follows that, when any measurement in A 
is performed on the system, one of N possible results 
is observed. Accordingly, we shall denote the abstract 
quantum model of such a set-up by q(iV). 

Finally, we shall need the Composite Systems postu- 
late, below, in order to obtain a rule, which we shall refer 
to as the composite systems rule, for relating the quan- 
tum model of a composite system to the quantum models 
of its component systems: 

4. Composite Systems. Suppose that a system ad- 
mits a quantum model with respect to the measure- 
ment set A^ whose measurements have N pos- 
sible results, and admits a quantum model with 
respect to measurement set A^ whose measure- 
ments have N' possible results, where the sets A^ 
and A^ are disjoint. 

Consider the quantum model of the system with 
respect to the measurement set A = A^ x A^ 
that contains all possible composite measurements 
consisting of a measurement from A^ and a mea- 
surement from A^ . If the states of the subsys- 
tems can be represented as (p$; \i) {i = 1> 2, . . . , N) 
and (p'px'j) (i = 1,2, . . . ,7V'), respectively, then 
the state of the composite system can be repre- 
sented as {p'{; x'l) {1 = 1,2,..., NN 1 ), where p'{ = 
Pip'j and x'i =Xi + Xj, where I = N'(i - 1) + j. 



IV. PHYSICAL COMPREHENSIBILITY OF 
THE POSTULATES 

When formulating the postulates, our goal has been to 
maximize their physical comprchcnsibility. For the pur- 
poses of discussion, it is helpful to distinguish two levels 
of physical comprchcnsibility First, at the minimum, 
a comprehensible postulate is one that can be transpar- 
ently understood as a simple assertion about the physical 
world. If this is the case, we shall say that the postulate 
has the property of transparency. Second, a postulate has 
an additional level of comprehensibility if it can also be 
traced to well-established experimental facts and phys- 
ical ideas or principles, a property we shall refer to as 
traceability. 

To illustrate these ideas, consider the example of Ein- 
stein's postulate that the speed of light emitted by a 
source is independent of the speed of the source. The 
postulate can be transparently understood as a physical 
assertion in itself. In addition, the postulate can also 
be understood as a reasonable inference from the well- 
established results (namely, the constancy of the two-way 
speed of light) of the Michelson-Morley experiment, the 
generalization from the specific context of the experiment 
being achieved by an appeal to the general principle of 
the uniformity of nature. Hence, the postulate is both 
transparent and traceable. 

In our discussion below, we shall organize the postu- 
lates into three groups according to their origin: 



1. Based on Experimental Observations. Postu- 
lates obtained through direct generalization of ex- 
perimental facts that are taken to be characteristic 
of quantum phenomena. 

2. Drawn from Classical Physics. 

2.1 Postulates adopted unchanged from the theo- 
retical framework of classical physics. 

2.2 Postulates obtained from classical physics 
via a classical-quantum correspondence argu- 
ment. 

3. Novel Postulates. Postulates based on novel the- 
oretical principles or ideas which cannot obviously 
be traced to classical physics or to experimental 
observations. 

The first group of postulates are obtained by direct gen- 
eralization of elementary experimental facts, such as the 
statistical nature of experimental results, that can be rea- 
sonably taken as characteristic of quantum phenomena. 
Insofar as these postulates can be regarded as reason- 
able generalizations of experimental facts, they can be 
regarded as possessing transparency and traceability 

In formulating the second group of postulates, we rec- 
ognize that the assumptions underlying the theoretical 
framework of classical physics are transparent and trace- 
able to well-established experimental facts and theoreti- 
cal ideas, and remain fundamental to the way in which 
we conceptualize the physical world. Accordingly, we at- 
tempt to preserve these assumptions as far as possible 
in the face of quantum phenomena. In particular, some 
of the postulates are obtained by simply adopting fun- 
damental features of the classical theoretical framework, 
while the others are obtained by transposing particular 
features of the classical models of physical systems into 
the quantum realm via a classical-quantum correspon- 
dence argument. 

Finally, the third group of postulates consist of novel 
postulates, one physical postulate (the Complementarity 
postulate) and two information-geometric postulates (the 
Information Metric and the Metric Preservation postu- 
lates) . 

A. Postulates based upon experimental facts 

Postulate 1.1: Results 

Consider an experiment in which Stern-Gerlach prepa- 
rations and measurements are performed upon silver 
atoms, and where the set A consists of the elements Ag^ 
representing Stern-Gerlach measurements in the direc- 
tion (6*, 4>). In this experimental set-up, which is closed in 
the sense defined earlier, we find that each measurement 
yields one of two possible results. The Results postulate 
generalizes this finding by asserting that all the measure- 
ments in a measurement set have the same number, TV, 
of possible results. 
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Postulate 1.2: Measurement Simulability 

Consider again the above Stern-Gerlach experiment. 
In this experiment, if an interaction consisting of a uni- 
form magnetic field acts between the preparation and 
measurement, one finds that the probabilities of the mea- 
surement results are the same as those obtained if a dif- 
ferent Stern-Gerlach measurement is performed with the 
interaction absent. 

Using this observation, one finds that it is possible 
to simulate measurement Ag^ using any given measure- 
ment A £ A if followed immediately before and after by 
suitable interactions. The simulation behaves precisely 
as A-e,<t> insofar as the probabilities of measurement re- 
sults 1 and 2, and the corresponding output states, are 
concerned. Postulate 1.2 can be regarded as direct gen- 
eralization of this observation. 



B. Postulates adopted unchanged from classical 
physics 

A classical model of a physical system is based upon 
the partitioning, time and states background assump- 
tions given earlier, and these are adopted unchanged in 
the abstract quantum model. In addition to the assump- 
tions A\ and A2 concerning measurements given earlier, 
the classical model additionally makes the following key 
assumptions: 

B States. 

Bl Determinism. The state of the system and 
a theoretical description of a measurement 
that is performed on the system determine the 
measurement result. 

C Transformations. 

CI Mappings. Physical transformations of the 
system, either due to temporal evolution or 
due to a passive change of frame of reference, 
are represented by maps over the space of 
states. 

C2 One-to-one. The mappings are one-to-one. 

C3 Continuity. If a map represents a physi- 
cal transformation that depends continuously 
upon a real-valued set of parameters, then the 
map is continuously dependent upon these pa- 
rameters. 

C4 Continuous transformations. If a map rep- 
resents a continuous transformation (such as 
temporal evolution) that depends continu- 
ously upon a set of real- valued parameters, 
then, for some value of these parameters, the 
map reduces to the identity. 



We remark that, in asserting C1-C2, it is presupposed 
that physical transformations of a physical system are de- 
terministic and reversible, which prevents the description 
of irreversible or indeterministic transformations within 
the classical framework at a fundamental level. 

First, we consider those postulates which adopt classi- 
cal assumptions unchanged. The Mappings and One- 
to-one postulates respectively correspond to assump- 
tions CI and C2, while the Continuity and Continu- 
ous Transformations postulates correspond to assump- 
tions C3 and C4, respectively. 

Second, as described earlier, in light of the results of 
experiments involving quantum systems (such as Stern- 
Gerlach measurements on silver atoms), it is reason- 
able to modify assumptions Al, A2 to assumptions Al' 
and A2' (see Sec. MI A[) . and accordingly to modify Bl 
as follows: 

Bl' Statistical Determinism. The state of the system 
and a theoretical description of a measurement that 
is performed on the system only statistically deter- 
mine the measurement result. 

Assumption Bl' is incorporated within the Complemen- 
tarity postulate. 

C. Postulates obtained through classical-quantum 
correspondence 

A general guiding principle in building up a quantum 
model of a physical system is that, in an appropriate 
limit, the predictions of the quantum model of the system 
stand in some one-to-one correspondence with those of a 
classical model of the system. By establishing such a cor- 
respondence between the quantum and classical models 
of a particle, we shall transpose two elementary proper- 
ties of the classical model across to the quantum model 
and then, by generalization, transpose these properties 
across to the abstract quantum model, q(N). In this 
manner, we shall obtain a number of important postu- 
lates. 

The key idea, explicated in detail below, is that a 
quantum model of a particle moving in one dimension 
corresponds, in the appropriate classical limit, to the 
classical ensemble model of the particle, namely the 
Hamilton- Jacobi ensemble model. We consider coarse 
position measurements, and discretize the Hamilton- 
Jacobi state (P(r,t), S(f,t)), where P(f,t) is the prob- 
ability density function over position and S(f, t) is the 

action function, as (p| CM ' ) ; Si), where j?| CM ^ is the proba- 
bility that the position measurement yields result i (i = 
1, . . . , AT), and Si is the action associated with position i. 
On the assumption that the quantum state can be put 
into one-to-one correspondence with the classical state in 
the classical limit, it follows that the quantum state of 
the particle must be of the form (pi',Xi)i where, again, 
the pi are the probabilities of the position measurement, 
and where the Xi are N real degrees of freedom. In the 
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limit, we assume Pi = p\ and Si = a\i, where a has 
the dimensions of action. 

Using this correspondence, we can transpose elemen- 
tary features of the Hamilton-Jacobi model across to the 
quantum model of the particle. Two properties will be 
of importance, to which we shall refer as global gauge 
invariance and temporal evolution. 

The first property arises from the fact that the trans- 
formation (p| CM ' 1 ; Si) — > (p| CM '';S' l + So), where S is 
an arbitrary real number, is a global gauge transforma- 
tion, which therefore leaves the predictions of the classi- 
cal model invariant. This leads to the assumption that, 
in the quantum model of the particle, the transforma- 
tion (pi\ Xi) — > (p»; Xi + Xo) is also a global gauge trans- 
formation. We then generalize by assuming that this also 
holds of the abstract quantum model, q(iV) . This global 
gauge invariance condition leads to two postulates. 

First, from the global gauge invariance condition, 
it follows that, in particular, the map, A4, represent- 
ing physical transformation of a system described using 
model q(iV), which takes the state (Pi',Xi) to (f4;Xi)> 
is such that the p\ are invariant under the transforma- 
tion (pi\Xi) -> (Pi',Xi + Xo) for any xo- This is the con- 
tent of the Gauge Invariance postulate, and can be seen 
to impose a constraint upon M.. 

Second, we require that the mea- 
sure, • • • , Pn'i Xi; • • • i Xn), induced by the metric 
over state space is consistent with the global gauge trans- 
formation, so that . . . ,pat;xi +Xo, ■ ■ • ,Xn + Xo) = 
. . . ,p N ;xi, ■ ■ ■ , Xn) for any xo, which is the con- 
tent of the Measure Invariance postulate, and imposes a 
constraint upon the functions / and /. 

The second property arises by considering the spe- 
cial case of a classical ensemble in a time-independent 
background with state (p[ CM \t); Si(t)), whose observ- 
able degrees of freedom (namely the p^ CM ^ and the AS; = 
S2+1 —Si, with 1 = 1,..., N— 1) are time- independent. In 

this case, the state evolves in time At to (p\ CM \t); Si(t) — 
EAt), where E is the total energy of the system. That 
is, in this particular case, the temporal rate of change 
of the Si encodes the energy of the system. Using the 
correspondence Si = otXii we assume that, for a system 
described within the model q(AQ in a time-independent 
background, and in a state (i) whose observable degrees 
of freedom are time-independent, and (ii) which is of def- 
inite energy E, the temporal rate of change of the Xi 
is E/a. This constitutes the Temporal Evolution postu- 
late. 

From the global gauge invariance condition and the 
Temporal Evolution postulate, we obtain a further pos- 
tulate concerning composite systems We consider a com- 
posite system consisting of two subsystems and require 
that (a) a global gauge transformation on cither sub- 
system leads only to a global gauge transformation on 
the composite system, and (b) in the case where the ob- 
servable degrees of freedom of both subsystems are time- 
independent, the subsystems are in states of definite en- 



ergy E and E' , respectively, and their environment is 
time- independent, the energy of the composite system 
is E" = E + E' . From these two ideas, we obtain the 
Composite Systems postulate. 



1. The Correspondence Argument 

Consider an experiment in which a position measure- 
ment is used to prepare a particle at time to, and a posi- 
tion measurement is subsequently performed at time t\, 
during which interval a potential V(r,t) is assumed to 
act. When such an experiment is actually performed, 
one necessarily uses position measurements with a finite 
number of possible measurement results. In this case, 
the experimental results (where, for instance, an elec- 
tron passes through a sub-micron aperture, is subject to 
electric-field interactions, and is subsequently detected 
on a screen) support the conclusion that, if these coarse 
position measurements are of sufficiently high spatial res- 
olution, the preparation is, to a very good approximation, 
complete with respect to the subsequent measurement. 

Suppose, then, that a coarse position measurement 
with N possible results is used to implement both the 
preparation and measurement steps, and further let us 
suppose that the coarse measurement is such that the 
probability that a detection is obtained in any run of the 
experiment is very close to unity. Further, let us suppose 
that the coarse measurement is of sufficient resolution 
that the preparation can be regarded as being complete 
with respect to the measurement. Then we can form a 
quantum model, which we shall denote q*(7V), within the 
framework of the abstract quantum model q(N), which 
approximately describes the experiment after time to- 

By the Results postulate and the assumption Bl' 
above, the state, S(ti), of the system immediately prior 
to the coarse position measurement determines the prob- 
ability n-tuple, p(ti) = (pi, . . . ,Pn), where ^ is the prob- 
ability of detection at the ith detector, which character- 
izes the data obtained from the coarse position measure- 
ment. 

If the above experiment is repeated, except that the 
coarse position measurement is delayed until time t<i, 
then S(ti), together with a theoretical representation 
of any interaction in the interval \t\,t2\, must (by as- 
sumption Bl') enable the prediction of the probability 
n-tuplc p{t2) that describes the coarse position measure- 
ment data obtained at time t^. To determine what ad- 
ditional degrees of freedom the state S(ti) must contain 
in order to make this prediction possible, consider the 
classical limit. 

Suppose that m is increased towards values character- 
istic of macroscopic bodies. Under the assumption made 
above, the preparation is complete with respect to the 
measurement, so that the system continues to be well- 
described by the model q(iV) even in this classical limit. 
However, as m tends towards macroscopic values, it is 
reasonable to expect that the system will increasingly 
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behave in accordance with its classical model between 
times t\ and £2- That is, in this classical limit, we expect 
that pfa), which is determined in the quantum model in 
terms of p{t\) and the other degrees of freedom in S(ti), 
will coincide with the n-tuple f^ GM \t2) that is predicted 
by a classical model of a particle of mass m moving in 
the same potential. 

The relevant classical model in this situation is a par- 
ticle ensemble model. For such an ensemble model, one 
can choose to describe an ensemble for the case of given 
total energy by means of a probability density function 
over phase space, and to describe the evolution of this 
function using Newton's equations of motion. Alterna- 
tively, one can employ the Hamilton- Jacobi model, which 
is physically equivalent. We choose the latter since it is 
more easily described on a discrete spatial lattice. 

In the Hamilton-Jacobi model, the state of the en- 
semble is given by (P(f,t),S(f,t)), which satisfies the 
Hamilton-Jacobi equations, 

§ + v.fWUo 

dt \m J (6) 

^Sf + V { r,t) = --, 

where P(r,t) is the probability density function over 
position and S(f,t) is the action function. In the 
case of coarse position measurements with N possi- 
ble results, we shall use the discretized form of the 
Hamilton-Jacobi state, (p^ M \ ■ ■ ■ ,p^ M \ Si, ... , Sn) = 

(p- CM '; Si), where p- CM ^ is the probability that the posi- 
tion measurement yields a detection at the ith measure- 
ment location, and Si is the classical action at the zth 
measurement location. 

In order that the predictions of the quantum and 
classical models agree in the classical limit, the quan- 
tum state S(t) (t > to) must contain degrees of free- 
dom which encode N quantities, which we shall de- 
note S^ M \ . . . , S^ M \ which, in the classical limit, are 
equal to the Si. Equivalcntly, we shall assume that S con- 
tains N dimensionless real quantities, Xi, . . . ,xn, such 
that gjQ M ) = axi, where a is a non-zero constant with 
the dimensions of action. 

From the above discussion, in the model q*(iV), the 
state, S, is given by {p,x), where x = (Xi> ■ • -,Xn)- A 
direct generalization of this observation leads to the as- 
sumption that the state of a system described by the ab- 
stract model q(-/V) with respect to some measurement A 
can be represented by (p, x)- As will be discussed below, 
this assumption provides the motivation for the State 
Representation postulate. 

2. Global Gauge Invariance 

In the continuum Hamilton-Jacobi model, the ob- 
servables associated with S(x,t) for a system in 
state (P(x,t), S(x,t)) are dS/dx and dS/dt. Hence, the 



transformation S(x,t) — > S(x,t) + So is a global gauge 
transformation of the model. Therefore, the discretized 
form of the model has a global gauge invariance prop- 
erty, namely that, for a system with state (p\ CM ^ ; Si), 
the transformation Si — > Si + So for i = 1, . . . , N and for 
any So is a global gauge transformation, leaving invariant 
all physical predictions made on the basis of the state. 

From this property of the Hamilton-Jacobi model, us- 
ing the above classical-quantum correspondence, we as- 
sume, in the quantum model of a particle, and, even 
more generally for the abstract quantum model q(A), 
the transformation 

(p»;x») -> ij>i\Xi + xo), (?) 

where xo <= K, is also a gauge transformation. From this 
assumption, we now shall draw two postulates. 



Postulate 3.6: Gauge Invariance 

First, we note that, as a direct result of this global 
gauge invariance assumption, it follows that a transfor- 
mation (representing passive or active physical trans- 
formation of the system) of the state (pi,Xi) to the 
state (p-; x'i) is such that the p[ are unchanged if an arbi- 
trary real constant, xo, is added to each of the Xi- This 
is the content of the Gauge Invariance postulate, which 
may be regarded as a specific example of the assumed 
global gauge invariance property. 



Postulate: 2.5: Measure Invariance 

Second, we impose the requirement that the mea- 
sure (or, in the language of Bayesian probability the- 
ory, the prior) over p\ , . . . , p N , Xi , ■ ■ ■ , Xn induced by the 
metric over state space (which metric arises from the In- 
formation Metric postulate) is compatible with the global 
gauge invariance property, and therefore satisfies the re- 
lation 

m(p»;Xi,---,Xat) =M(P»;Xi+Xo,---,Xiv + Xo), (8) 

for any xo> which is the content of the Measure Invariance 
postulate. 

The requirement of the consistency of the measure with 
the global gauge invariance property can be understood 
as follows. Suppose that one is performing Bayesian in- 
ference on the quantum system. If one's knowledge about 
the quantum system includes the fact that it has a global 
gauge invariance property, then the prior over the pi 
and Xi which one employs should reflect this fact. Oth- 
erwise, one's inference will sometimes lead to predictions 
that are not consistent with the global gauge invariance 
property. 
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Postulate 3. 7: Temporal Evolution 

Consider the special case of a system in a time- 
independent background whose observable degrees of 
freedom are time- independent. According to the 
Hamilton- Jacobi equations, the state of such a system 
evolves in time At as 

(P(x, t + At),S(x, t + At)) = (P(x, t),S(x, t) - EAt), 

where E is the energy of the ensemble. That is, the 
temporal rate of change of the unobservable degree of 
freedom encodes the total energy of the system. 

Using the above classical-quantum correspondence, we 
assume that the quantum model of a particle in a time- 
independent background which is in a state of definite 
energy with time-independent observable degrees of free- 
dom, evolves as (pi',Xi) ~ * {Pi\Xi ~ E/aAt) during the 
interval [t, t+At]. The Temporal postulate directly trans- 
poses this assumption to the quantum model q(iV). 



Postulate 4 : Composite Systems 

Let us consider a composite system, described in the 
model q(A"), consisting of two subsystems that are 
known to be in states represented by (p,; Xi) an d ip'f, x'j)i 
respectively, with i = 1, . . . , N and j = 1, . . . , N', 
and N" = NN'. The composite system is in a state 
represented by (p" , x"), where I = N'(i — 1) + j, and we 
assume that 

Pi = PiPj ( 9a ) 

Xi = giXhXj), (9b) 

where g is a function, symmetric in its arguments, to be 
determined. 

Suppose that the first subsystem undergoes the gauge 
transformation (pcXi) ~ * (PiiXi + Xo)- We require that 
this transformation leads to a gauge transformation of 
the composite system, so that 

(Pi,Xi)^(p'i',Xi+h(xo)), (10) 

where h is some function to be determined. Together 
with Eq. (|9a[) . this implies that g is linear in its first 
argument. Applying the same argument to the second 
subsystem, one obtains that g(Xi,Xj) — c Xi + d>x'j + e - 
Imposing symmetry, and setting e = without loss of 
generality, we obtain g(xi,Xj) = c (Xi + x'j)- 

To determine c, we apply the Temporal Evolution pos- 
tulate. We require that, if the energies of the subsystems 
are E and E', respectively, the energy of the composite 
system is E + E' . From the Temporal Evolution pos- 
tulate, it follows at once that c = 1. Hence, we ob- 
tain g{xux!j) — Xi + Xj- Therefore, the state of the 
composite system (p", x'{) = iPiPj] Xi + Xj-)j which is the 
content of the Composite Systems postulate. 



Alternatively, one can obtain this result more directly 
from the Hamilton- Jacobi model. We note that, if, with 
respect to position measurements along the x and y 
axes, the discretized Hamilton- Jacobi state of a particle 
is {p^ Si) and (p'j] S'j), respectively, where i — 1, . . . , N 
and j = 1, . . . , N', then, with respect to (x, y)-position 
measurements, its state is (j>"\ S") — [pip'pSi + Sj) 
where I = N(i — 1) + j. By the classical-quantum corre- 
spondence, and then generalizing to the abstract quan- 
tum model, q(-ZV), we obtain the same result. 

D. Novel Postulates 

Postulate 2.1: Complementarity 

According to the discussion of correspondence above, 
the state S(t), written with respect to some measure- 
ment A € A, can be represented by the pair (p,X)i 
where p contains the probabilities of the measurement 
results, and x 1& an ordered set of real- valued degrees of 
freedom. Hence, the state consists of a mixture of proba- 
bilities and degrees of freedom unconnected to probabili- 
ties, and measurement A yields information about the pi 
but not the Xi- The Complementarity postulate is mo- 
tivated by the aesthetic desideratum the the quantum 
state, as far as possible, should consist of probabilities of 
events rather than being such a mixture, and aims to ex- 
press the restriction on measurement A as a restriction 
on the ability of measurement A to completely resolve 
the events that occur when it is performed. 

In particular, we hypothesize that, when measure- 
ment A is performed, there are, in fact, 2N possible 
outcomes, with respective probabilities Pi, ... , P2N, and 
that result i is observed whenever either outcome 2i— 1 or 
outcome 2i is realized. We note that similar assumptions 
have been made in toy models of quantum theory in order 
to give concrete expression to complementarity [20l |. 

In Sec. IVI A| we sketch some ideas which help to pro- 
vide a better physical understanding of this postulate. 

Postulate 2.2: States 

The States postulate asserts that the state of a system 
with respect to measurement A 6 A is given by Q = 
(Qi,...,Q 2 n) T , with Q q e [-1,1], where P q = Q\. 
Hence, in addition to the P q , this postulate asserts that 
there is an additional, binary degree of freedom, cr q , asso- 
ciated with each outcome q, where <r q — sign(Q 9 ), which 
is defined whenever P q =/= 0. 

The motivation for the introduction of the a q is the 
following. If one takes the P themselves as the state 
space of the system, one finds that non-trivial one-to- 
one transformations of the state space that preserve the 
metric over the state space (as we shall require in the 
Metric Preservation postulate, described below) are not 
possible. A simple way to allow the existence of such 
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transformations is to take all Q as the state space of 
the system, for then there exist transformations, such as 
orthogonal transformations of the Q, which preserve the 
metric over the Q. 



Postulate 2.3: State Representation 

The State Representation postulate connects together 
the hypothesis, expressed by the States postulate, that 
the state of a system is given by Q = (Q\, . . . , Q2n) 
and the above assertion that the state can be represented 
by {p, X) ■ Specifically, labeling the 2N possible outcomes 
of measurement A as la, lb, ... , Na, Nb, we can redraw 
the probability tree of Fig. [1] as shown in Fig. [2j where 
the results 1,2, ... ,N of the measurement are shown in 
the upper level of the tree. 

If result i is observed, the experimenter does not know 
whether outcome ia or ib was realized, or what were 
their polarities. The probability that ia was realized 
given that result i was obtained is denoted p a u, and sim- 
ilarly the probability that ib was realized given result i 
is denoted p^u. We can encode these probabilities and 
polarities into the quantities q a u and q^u by requiring 
that p a \i = q 2 ^ and p b ^ — q 2 ^, and that the polari- 
ties <72i-i = sign(q a | i ) and a^i = sign(q & | i ), the polarities 
being defined whenever the corresponding probabilities 
are non-zero. 

The State Representation postulate now connects 
the q a u and q^u together with the Xi by asserting that 

9a\i = fiXi) 

%\i = f(Xi), 

where / is not a constant function, and where the func- 
tions / and / are function, assumed differentiable, to be 
determined. 



Postulate 2.4: Information Metric 

The Information Metric postulate asserts that the met- 
ric over the space of probability distributions P is the in- 
formation metric which, as discussed in the Introduction, 
naturally arises if one considers questions of information 
gain. 



Postulate 3.5: Metric Preservation 

The Metric Preservation postulate accords the met- 
ric over state space a fundamental place in the theo- 
retical framework: any transformation of state space is 
required to preserve the distance between any pair of 
nearby states. This idea is similar to the premise of 
Wigner's theorem, namely that transformations of the 
space of pure states of a quantum system preserve the 



Hilbert space angle between any two pure states, which 
can be interpreted as requiring that transformations pre- 
serve the distinguishability of any pair of states H . 

V. DEDUCTION OF THE QUANTUM 
FORMALISM 

The derivation will proceed as follows. First, in 
Sec. IV Al using the States and Information Metric postu- 
lates, we represent the state of a system, S(t), as a unit 
vector in a 2iV-dimensional real Euclidean space, Q 2N , 
and use two of the Transformations postulates (Mappings 
and One-to-One) to find that transformations over state 
space are orthogonal transformations. 

Second, in Sec. IV Bl the Measure Invariance postulate 
is used to determine the form of the functions / and / 
that are introduced in the State Representation postu- 
late. We then apply the remaining Transformations pos- 
tulates, which lead to the conclusion that transforma- 
tions representing physical transformations are, in fact, 
represented by a subset of the orthogonal transformations 
of the unit hypersphere, S ,2JV_1 , in Q 2N . We then show 
that these transformations can, equivalently, be repre- 
sented by the set of unitary and antiunitary transforma- 
tions of a suitably-defined iV-dimensional complex vector 
space. Finally, we show that physical transformations pa- 
rameterized by continuous parameters are represented ei- 
ther by unitary or antiunitary transformations, and that 
continuous transformations are represented by unitary 
transformations. 

Third, in Sec. IV CI we draw upon the Measurement 
Simulability postulate in order to obtain a representa- 
tion of measurements on a system, and, in Sec. IV Dl use 
the Composite Systems postulate to obtain the tensor 
product rule, which determines the state of a composite 
system in terms of the states of its subsystems. 

Finally, in Sec. IV El we generalize the formalism to al- 
low the description of measurements performed on sub- 
systems of a composite system, and to allow the descrip- 
tion of measurements with degenerate values. Addition- 
ally, in Sec. lVFI we obtain a metric over the space of pure 
states, and obtain a unitarily- and antiunitarily-invariant 
prior over b . 

A. States and Dynamics in Q-space 

1. Q-space Representation of State Space 

According to the States postulate, the state of 
a system prepared using any measurement in the 
measurement set A can be represented by Q = 
(Qi, Q2, Q2N—1, Q2n)- Now, the Information Met- 
ric postulate assigns the metric 

1 2N dP 2 

< 12 » 

q=l H 
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over P, which, from the relation P q = Q 2 , implies that 



the metric over Q is Euclidean, namely 



ds 2 = dQ 



(13) 



Hence, the state space of the system can be represented 
by the set of all unit vectors in a 2iV-dimensional real Eu- 
clidean space, which we will refer to as Q-space or Q 2N . 



2. Representation of Physical Transformations 

By the Mappings and One-to-One postulates, any 
physical transformation is represented by a one-to-one 
map, M., over state space, Q 2N . Furthermore, by the 
Metric Preservation postulate, M. must be an orthogonal 
transformation of the unit hypersphere, S 2N ~ 1 , in Q 2N . 



B. Correspondence, and Complex Form of States 
and Dynamics 

1. Determination of functions f and f 

According to the State Representation postulate, the 
state Q, of a system with respect to some measure- 
ment A € A can be written 

Q = (VPiq a \l> VPlQb\l, ■ ■ ■ , VPNla\N, VPNQb\N), (14) 

where q a \ A = f(xi) and qb\i = f(Xi)- In order to deter- 
mine the unknown functions / and /, we first determine 
the metric over Q in terms of the pi and \i ■ 
Using Eq. fm 



(15) 
(16) 

(17) 



ds 2 = dQl + dQ% -\ h dQj N _ 1 + dQ 2 2N 

p 1 (f' 2 (x l ) + f' 2 (x l ))dxl 
f ,2 (Xi) 



N 1 J 2 

1 dp 2 

i=i ri 



N 



4 pi 
»=i y% 



1 



! (x*) 



where we have used the relation f 2 (xi) + f 2 (xi) = 1 in 
the third line. Defining the function F(Xi) = / 2 (Xi); we 
can write the above as 



2 l^dp 2 



4 ^ pi 

i=l ^ l 



F' 2 (xi) 



F(xi)(l-F( Xi )) 



dxl (18) 



The measure over (pi,.. ■ ,pn', Xi> ■ ■ ■ j Xn) induced by 
this metric is proportional to the square-root of the de- 
terminant of the metric, and so is given by 



JY 



P>(j>, X) = Mo XI 



F'(Xi 



\ ^F( Xl )(l-F( Xl )) 



(19) 



where fio is a constant, which marginalizes to give 
fii{Xi) = 



(i(p, X) dVi 
F\ X i) 



(20) 



\/F( Xl )(l-F( Xl )) 



with dV% = dpi . . .dpN dxi ■ ■ ■ d X i~\d X i+\ . . . d X N, as the 
measure over X i, where c is given by 



dpi . . .dp 



N 



F\X) 



y/F( X )(l-F( X )) 



v AT-1 

dx| • (21) 



Now, using the Measure Invariance postulate, Pi(xi + 
Xo) is given by 

Kp,Xi, ■ ■ -,Xi-i,Xi +Xo,Xi+i, ■ ■ -,XN)dVj 

M(p>Xi + Xo, ■ ■ -,Xi-i +Xo,Xi +Xo, 

Xi+i + Xo,---,XN + Xo)dVi 

M(P>Xl) ■ ■ -)Xi-l>Xi,Xi+l, ■ ■ -,XN)dVi 

(22) 

with dVi ee dpi. . . dp N d Xx . . . dxi-idxi+i ■ ■ ■ d XN , where 
the variable substitution X j = Xj ~ Xo f° r j 7^ i has been 
used to obtain the second line, and Eq. (jHJ) has been 
used to obtain the third line. Hence, the measure fJti( X i) 
is independent of x%- Therefore, 



Mi(x» 



1 



dF( X ) 



2a, 



(23) 



^/F( X )(l-F( X )) d X 

where a is a constant, which has the general solution 

F( X ) = cos 2 (a X + b), (24) 

where 6 is a constant. The constant a is non-zero since, 
by the State Representation postulate, the function f( X ) 
is not a constant. Hence, the functions f( X ) and f( X ) 
have the form 



f(X) = ±cos(a X + b) 
fix) = ±sin(ax + &), 



(25) 



where the signs of / and / are undetermined. 

We note that the freedom in the choice of signs of / 
and / can be absorbed into the choice of a and b. That 
is, if one chooses the signs of / and / not to be both 
positive, then this is equivalent to choosing positive signs 
for / and / but changing the values of a and b to some 
other values, a' and b', respectively. Specifically, if one 
chooses the signs (+, — ), then a 1 = —a and b' = — b + it; 
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if (—,+), then a' = —a and b' — —6; and, if (—,—), 



then 



and b' = b + tt. Therefore, without loss of 



generality, the signs can be both taken to be positive. 

Defining 0j = a\i + b, we can write q a u = cos fa 
and (ft, | i = sin0j, and therefore, from Eq. (|14p . write 
the state of a system with respect to some measure- 
ment A £ A as 



2. Mappings 



/pN Sin 



M T - (26) 



In this section, the general form of mappings that rep- 
resent physical transformations of a system will be deter- 
mined. The derivation will proceed in three steps: 

(1) Show that the imposition of the Gauge Invariance 
postulate restricts M. to a subset of the set of or- 
thogonal transformations, and that these transfor- 
mations can be recast as unitary or antiunitary 
transformations acting on a suitably-defined com- 
plex vector space. 

(2) Show that any unitary or antiunitary transforma- 
tion represents an orthogonal transformation satis- 
fying the One-to-One, Gauge Invariance, and Met- 
ric Preservation postulates. 

(3) Show using the Continuity postulate that a physical 
transformation which depends continuously upon a 
real-valued parameter n-tuple can be represented 
by either unitary or antiunitary transformations, 
and show using the Continuous Transformations 
postulate that a continuous physical transforma- 
tion can only be represented by unitary transfor- 
mations. 

Step 1: Imposition of the Gauge Invariance pos- 
tulate. From Eq. (f2"6")l . the Gauge Invariance pos- 
tulate, and the relation fa = a\i + b given above, 
it follows that the probabilities p^^p^, ■ ■ ■ ,p' N of the 
results of measurement A performed on a system in 
state Q' = .M(Q) are unaffected if, in any state Q = 
( v /pT cos 0i , v /pT sin 0i , ... , yfp~^ cos cj> N , v /7jKF sin (f> N ) 
written down with respect to measurement A, an 
arbitrary real constant, O , is added to each of the fa. 
Our goal in this section is to determine the constraint 
imposed on M. by this condition. 

Since M. is an orthogonal transformation (Sec. IV A~2|) . 
it can be represented by the 2iV-dimensional orthogonal 
matrix, M. Under its action, the vector Q transforms as 



Q' = MQ. 



(27) 



In order to determine the most general permissible form 
of M, it is suffices to consider two types of special case. 

First consider the case where all but one of the Pi 
are zero. For concreteness, suppose that p\ = 1 



and P2, ■ ■ ■ ,Pn are all zero. In that case, from Eq. (I27p . 
using the relation p[ — Qf + Q 2 , we obtain 



{M 2 l2 +M 2 22 )} 



+ - [(M 2 , + Ml) - (Mi 2 2 + M| 2 )] cos 20i 
+ (MiiMi 2 + M 2 iM 22 ) sin 20i. 



(28) 



We require that p' x remains unchanged as a result of the 
addition of any constant 0o £ R to the <pi. However, a 
linear combination of the functions cos 20i and sin 20i in 
which at least one of the coefficients is non-zero is zero 
only on a discrete set of points. Therefore, the coefficients 
of the functions cos 20i and sin 20i must vanish, so that 
the conditions 



Ml 



M 2 21 



M 2 2 



M 2 2 



M11M12 + M21M22 = 
must hold, which implies that 

(cos^pw — en sin tpn 



Mn M12 
M 2 i M 22 



an 



\ sin Lpn (Tiicosv?!!, 



(29) 
(30) 



(31) 



where an = ±1, which is matrix composed of a en- 
largement matrix (scale factor an) and a rotation ma- 
trix if (7n = 1 or a reflection-rotation matrix (that is, 
a matrix representing a reflection followed by rotation) 
if (Tn = — 1) with rotation angle (pn in either case. 
More generally, in the case where Pi = 1, one obtains 



Pk = \ (aki+0ki) + \ {a-u 



/3fei)cos20 l 



(32) 



for k = 1, 



7fei sin 20i 
. , N, where 



M, 



/3 fel = M. 



2fc-l,2j 
2 

2fe-l,2i 



^2 2 fc,2^1 



M. 



2k,2i 



Iki — M2k-l,2i-lM 2k _ 1 ^ 2i + M 2k ^ 2i ^ 1 M 2k ^ 2i . 
The invariance condition implies the conditions 
OLki = Phi and jki = for all i, k, 



(33) 



(34) 



which implies that M takes the form of an N-by-N array 
of two-by-two sub-matrices, 



M 



/r(n) 

T (21) 



j-(12) 
T (22) 



T (1A0\ 
T (2N) 



(35) 



ji(JVl) T(N2) j,(NN) j 



where 



cos ipij —<Jij sin ipij 
sin tfij oy cos ip^ 



is a two-by-two matrix composed of a enlargement ma- 
trix (scale factor ay) and a rotation matrix if cry = 1 
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or a reflection-rotation matrix if cr* 



-1, with rotation 



angle ipij in either case. 

Before considering the second type of special case, we 
note that, due to the special form of M above, it is pos- 
sible to rewrite Eq. (f2"T)) in a simpler way. The state Q 
can be faithfully represented by 



/ Qi+iQ 2 
Q 3 + iQ 4 

W2JV-1 + iQ 



(36) 



IN/ 



so that, from Eq. (|26|) , v; = ^fple l ^ i . Define the complex 
matrix 



W 



/ aue^Jf" 11 ... a 1N e tVlN K aiN \ 
a 2 ie lV21 K a21 ... a 2N e lV2N K a2N 

\a N1 e iVN1 K° N1 ... a NN e lVNN K' TNN j 



, (37) 



where K a is the conditional complex conjugation opera- 
tion defined as 



z if a = 1 



(38) 



if a= -1, 

where z E C. Then, Eq. |27|) is equivalent to the equation 

v' = Wv, (39) 

where v' is defined analogously to v. 

Consider now the second special case, where two of 
the pi, say pi and pj (i ^ j) are set equal to 1/2, and the 
remainder are set to zero. For example, if pi = p 2 = 1/2, 
then Eq. flU yields 



Pi 



(2 1 
= ^{"ll + Q '12 

+ 2auQtl2 cos [(^11 ~ ^12) + (cil^l - CT12^2)]}. 

(40) 

In order that remains unchanged as a result of the 
addition of fa € M. to the fa, either ana±2 = or an — 

0"12- 

More generally, in the case where pi = pj = 1/2, 
with i ^ j, one obtains the expression 



Pk = l{ a li + a fcj 

- 20^0^- COS [(y>fcj - </?fcj) + (o-fei^i - CTkjfa)]}, 



(41) 



and the invariance condition implies that, for any k and 
any i ^ j, the values of aki and cr/^ must be the same 
unless aki = or otkj = 0. 

Since M represents the mapping M. , and, by the One- 
to-One postulate, Mr 1 exists, the matrix M _1 repre- 
sents the mapping Mr 1 . Hence, the matrix M _1 = M T 



must also satisfy the Invariance postulate. Now, from 
Eq. (j3"5|) , the matrix M T takes the form 



M 



/(T( n )) T (t( 21 )) t ... (t( w1 )) t \ 
(r( 12) ) T (t( 22 )) t ... (t( N2) ) t 

y(T (1N *>) T (T( 2Ar )) T ... (T( NN ')) T / 
and the corresponding complex matrix is 



(42) 



Wi, 



(43) 



Consider the transformation v' = Wv, with the above 
special case, namely Pi = pj = 1/2, where i 7^ j. In this 
case, one obtains 

Pk = \{ a lk + "ife 

- 20tikCXjk COS [( — <Tik<Pik + Vjkfjk) + (?ikfa ~ Vjkfa)]}- 

(44) 

The invariance condition implies that, for any k and 
any i ^ j, the values of and must be the same 
unless aik — or ctjk = 0. But, this implies that, in W, 
all of the non-zero entries have the same value of aik- 
Therefore, the form of W is of one of two types, 



W = V or W = VK, 
corresponding to the cases where the Oi 



the a-. 



T, respectively, where 



(45) 
1 and 

(46) 



and K is the complex conjugation operator, Kv = v*. 

Now, since M is orthogonal, it follows that V is unitary. 
To see this, consider M with all cry = 1. In that case, 



(M T M) M = [T^j 
k 

= *^2a k iakjR(-<Pki)R((pkj) (47) 

k 

= ^ a ki akjR((fikj - fki) 
k 

where Auj] denotes the (i,j)th two-by-two submatrix 
of A, with A being an N by N array of two-by-two sub- 
matrices, and where R(<p) is a two-by-two rotation matrix 
with rotation angle ip. Consider also 



(48) 



By inspection, the orthogonality condition (M J M)uji — 
5ijl, where I is the two- by- two identity matrix, is equiv- 
alent to the condition of unitarity, (V^V)^ = <Jy. There- 
fore, since M is orthogonal, V is unitary. Hence, from 
Eq. (|45|) . matrix W is either unitary or antiunitary. 
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Finally, we must show that transformations of the 
types in Eq. (j4"5)) satisfy the Gauge Invariance postulate 
for any state Q, not just for the special cases of Q con- 
sidered above. This follows immediately: we note that 
the addition of 4>q to each of the <pi in the complex form 
of the state, v, generates the state e 1 ^ v, that is 



As a result, the vector v' = Vv transforms as 



and the vector v' = VKv transforms as 



(49) 



(50) 



+ 00 



e - ** v', 



(51) 

Since p\ — |w.-| 2 , the p[ are independent of the over- 
all phase of v', so that, in both Eqs. (|50|) and (f51~j) . 
the p[ remain unchanged by the addition of <po to the <pi . 
Therefore, the transformations V and VK both satisfy the 
Gauge Invariance postulate for all Q. 

Step 2: General Unitary and Antiunitary Transforma- 
tions. We have shown so far that the imposition of the 
Gauge Invariance postulate restricts M to a subset of the 
set of orthogonal transformations, and that each trans- 
formation in this subset can be recast either as a uni- 
tary transformation or as an antiunitary transformation. 
However, we have not ruled out the possibility that there 
exist unitary or antiunitary transformations which are 
not equivalent to orthogonal transformations belonging 
to the above mentioned subset. In this section, it shall 
be shown that, in fact, any iV-dimensional unitary or an- 
tiunitary transformation satisfies the One-to-One, Metric 
Preservation, and Gauge Invariance postulates. 

Consider first the arbitrary unitary transformation U, 
where Uij = onj e lVij . The transformation 

v' = Uv 



is equivalent to the transformation 
Q' = MQ, 



(52) 



(53) 



where Mua = aijRfaij). Now, as we observed above, 

the condition of unitarity of U, namely (U^U)^ = 5ij, is 
equivalent to the orthogonality condition of M. There- 
fore, M is orthogonal. 

Similarly, in the case of the arbitrary antiunitary trans- 
formation UK, where U is defined as above and K is 
the complex conjugation operator, the corresponding ma- 
trix M is given by Muj] = ctijR((pij)F, where F 
1 
-1 



In this case, 



k 

= ^ a k iak]FR{-ip ki )R{ip k: j)F 

k 

= *^2a ki a k: jR((p k i - <p k j), 



(54) 



which is SijI due to the unitarity of U. Therefore, M is 
orthogonal in this case also. 

Since M is an orthogonal matrix, it satisfies the One- 
to-One and Metric Preservation postulates. The invari- 
ance of the p[ required by the Gauge Invariance postulate 
follows from the observation made previously that, under 
the addition of 4>o to the (/>.; in v, the transformed vec- 
tors v' = Uv or v' = UKv yield unchanged values of p^. 

Hence, any unitary or antiunitary transformation sat- 
isfies the One-to-One, Metric Preservation, and Gauge 
Invariance postulates. In particular, we have obtained 
the result of Wigner's theorem that a one-to-one map 
over state space that represents a symmetry transforma- 
tion of a system is either unitary or antiunitary. 

Step 3: Physical Transformations. By the Continuity 
postulate, a physical transformation (such as a reflection- 
rotation of a frame of reference) that depends contin- 
uously upon a real-valued parameter n-tuple ir is rep- 
resented by a map M.^ which depends continuously 
upon 7r. From the above discussion, the set of map- 
pings that represent physical transformations are the set 
of all unitary and antiunitary transformations. The set 
of all unitary transformations and the set of all antiu- 
nitary transformations are disjoint and it not possible 
to continuously transform a unitary transformation into 
any antiunitary transformation. Therefore, M.^ is rep- 
resented either by unitary transformations or by antiuni- 
tary transformations. 

Furthermore, by the Continuous Transformations pos- 
tulate, a continuous physical transformation that de- 
pends continuously upon a real-valued parameter n- 
tuple 7r is represented by a map M.^ which reduces to the 
identity map for some value of n. However, only the set of 
unitary transformations include the identity. Therefore, 
a continuous physical transformation can only be repre- 
sented by unitary transformations. In particular, since 
temporal evolution of a system is a continuous physical 
transformation, it must be represented by unitary trans- 
formations. 



C. Representation of Measurements 

In the previous section, it has been shown that the 
state of a system at time t that has been prepared by a 
measurement in A can, from the point of view of a mea- 
surement A € A, be represented by the complex vector 



i<t>i 



P2 e 



i<t>2 



p N e 



(55) 



where the pi are the probabilities of the results of mea- 
surement A if performed at time t. Furthermore, it has 
been shown that any interaction following the prepara- 
tion can be represented by a unitary transformation of v. 

Consider an experiment where a system undergoes 
some measurement A € A, yields a particular result, and 
subsequently undergoes some other measurement A' 6 A 
that may or may not be the same as A. The purpose 
of this section is to develop the formalism necessary to 
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predict the measurement probabilities in such an exper- 
iment. 



1. Prepared States 

Suppose that, in the above-mentioned experiment, a 
system undergoes measurement A and yields result j. 
What is the state of the prepared system? 

By the Results postulate, measurement A has N pos- 
sible results and, by the assumption of reproducibil- 
ity (Sec. IIII A"|) . after A has been performed and result j 
obtained, immediate repetition yields the same result 
with certainty. Therefore, for every result j there exists 
a corresponding state, Vj, such that the measurement A 
upon the system in state Vj yields result j with certainty. 
From Eq. (|56"j). since pj = 1 and all the other pj are zero, 
we have that 

v J = (0,...,e^,...,0) T , (56) 
where <f>j is undetermined. 

2. Measurements 

By the Measurement Simulability postulate, measure- 
ment A' can be simulated by an arrangement consisting 
of a measurement A followed immediately before and 
after by suitable interactions. These interactions bring 
about continuous transformations of the system. From 
the results of the previous section, these interactions 
must, therefore, be represented by unitary transforma- 
tions, which we shall denote U and V, respectively (see 
Fig. [S]). In the following, we shall establish the form of 
these matrices, and then obtain an expression for the 
probabilities for measurement A' performed on a system 
in state v. 

Result 
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— > 
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FIG. 6: Simulation of measurement A'. A unitary transfor- 
mation, U, transforms the input state, v, into Uv. Measure- 
ment A is performed on this state, and yields a result and the 
outgoing state, v, which is then transformed by the unitary 
transformation V into the output state Vv. 

First, from the Results postulate and the assumption 
of reproducibility, there exist N states \z[ , v' 2 , . . . , v' N such 
that measurement A' performed on a system in state 
yields result i with certainty. Hence, the arrangement in 
Fig. [6] must be such that A yields result i with certainty 
when the input state to the arrangement is For this 
to be the case, U must transform v^ to a state of the 



form v^e^ i , where & is arbitrary. That is, the matrix U 
must satisfy the relations 

UvJ=v <e *S i = l,2,...,N (57) 

Second, if result i is obtained from the arrangement, 
the output state of the arrangement must be of the 
form v^e^s where ^ is arbitrary. But, immediately after 
measurement A, the system is in state up to an overall 
phase. Hence, the matrix V must satisfy the relations 

V Vi =v-e^ i = l,2,...,JV (58) 

From Eq. ([So]) , the v$ form an orthonormal basis 
for C^, and, from Eq. ([FT)) , v- = Uwje^, which, since U 
is unitary, implies that the also form an orthonor- 
mal basis. Therefore, any state, v, can be expanded 
as X)i c i v i: with c[ S C, and the matrices U and V are 
determined by the relations in Eqs. ([57)) and (|58p up to 
the £i and the 

It is now possible to determine the measurement prob- 
abilities if a system in state v undergoes measurement A'. 
Using Eq. ([57)1 and the expansion v = '}2 li c' j y' i: the first 
interaction of the arrangement transforms v into 

The probability that measurement A in the arrangement 
yields result i is therefore \c' i \ 2 . Hence, measurement A' 
performed on the state v yields result i with probabil- 
ity \c' i \ 2 . If result i is obtained, the outgoing state of 
measurement A is v^, so that the output state of the 
arrangement is Vv^ which is v£ up to an overall phase. 

In summary, every measurement, A' G A, has an asso- 
ciated orthonormal basis, {v'^vj,, . . . , v^y}. Such a mea- 
surement can be simulated by a measurement A followed 
immediately before and after by interactions represented 
by U and V defined in Eqs. (|57[) and (f5"8"|) in terms of these 
basis vectors. If measurement A' is performed upon a 
system in state v, the probability, p\, of obtaining re- 
sult i and corresponding output state v- is |c-| 2 , where c- 
is determined by the relation v = J2 i c i v 'i > which is the 
Born rule. 



3. Expected Values 

If the ith result of measurement A' has an associated 
real value c^, the expected value obtained in an exper- 
iment in which a system in state v undergoes measure- 
ment A' is defined as 
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Since p\ — |c-| 2 and c- 
written as 



A 



v, this expression can be also 



= vt ^vXvfjv 



the composite system is represented by (p'{; x'{) (I — 
1, . . . , AT") in the manner defined by the Composite Sys- 
tems postulate. If we write the states of the subsystems 
in complex form, 



(61) 



v (1) = (v$Te*Sv^< 



p N e 



and 



= v T A'v, 



where the matrix A' = 



is Hermitian since 



v ( 2 ) = f x /p[e^, ^M^' 2 , 



the a'i are real, and is non-degen erate s ince the a\ have respectively, where 4> t = a X t + b and $ = ax' + 6, and, 



been assumed to be distinct (Sec. IIII A|) 

Since the are eigenvectors of A', with the a' i being 
the corresponding eigenvalues, the matrix A provides a 
compact mathematical way of representing all the rele- 
vant details about measurement A'. 



D. Composite Systems 

It is often the case that a given physical system can be 
subject to examination in distinct experimental set-ups, 
where the measurements in each set-up probe distinct 
aspects of the system. Formally, we can express this as 
follows. 

Consider a system which admits abstract quantum 
model, q(AT), with respect to measurement set A^\ and 
which admits abstract quantum model, q(Af'), with re- 
spect to measurement set A^ 2 \ where the set-ups de- 
fined by measurement sets A^ and A^ 1 are disjoint (in 
the sense defined in Sec. IIIip . The system can also be 
modeled as a whole. That is, we can construct the mea- 
surement set A — A^ x A^ 2 \ and construct abstract 
quantum model q(AT"), where N" = NN'. We shall 
accordingly speak of the system as a composite system 
consisting of two subsystems. More generally, if a system 
admits d (d > 1) abstract quantum models with respect 
to d disjoint measurement sets, we shall speak of it as a 
composite system consisting of d subsystems. 

One often prepares a state of a composite system by 
first preparing each of its subsystems, and then allowing 
these subsystems to interact with one another. In order 
to formally describe such a procedure, one needs a rule, 
the composite system rule, which we shall now derive, 
that enables the state of the system to be written down 
in terms of the states of its subsystems. 



The Composite System Rule 

In order to derive the composite system rule, we shall 
apply the Composite Systems postulate to the case of 
a composite system with two subsystems with abstract 
models q(AT) and q(AT'), respectively, where the compos- 
ite system has the abstract model q{N"). 

Suppose that the subsystems are in states represented 
by (j>i',Xi) an d (jp'j)x'j)i respectively, and the state of 



similarly, write the state of the composite system as 



fpje^, 



where </>',' — ax" + b, then it follows from the Composite 
Systems postulate that v can simply be written as <X> 

v (2). 

More generally, consider a composite sys- 
tem with d subsystems, numbered l,2,...,d, in 
states v^, v^ 2 ), . . . , vW, respectively. We can regard 
subsystems 1 and 2 as comprising a bipartite composite 
system, system 1', which, according to the above 
result, is in state vW ® v^ 2 ). Next, we can regard 
system 1' and subsystem 3 as comprising a bipartite 
composite system, system 2', which is therefore in 
state (vW (g) v^ 2 )) <X> v^ 3 ). Continuing in this way, we can 
see the state of the composite system with d subsystems 
has the state v = v (1) <g> v (2) ® • • • <g> v (d) . 



E. Some Generalizations 

1. Representation of subsystem measurements 

Suppose that measurement £ A^ x \ represented 

by A^-dimensional Hermitian operator A^ , with eigen- 
states and eigenvalues a«, respectively, is performed 
on subsystem 1 of dimension N of a bipartite composite 
system of dimension N" = NN', where AT' is the dimen- 
sion of subsystem 2. With respect to the abstract quan- 
tum model q(A r ") of the composite system, measure- 
ment A*- 1 ) is not in the measurement set A of the com- 
posite system since the measurement has only N distinct 
results whereas a measurement in A has N" = NN' > N 
possible results. However, it is convenient to be able to 
describe measurement A^, which we shall describe as 
a subsystem measurement, as an W'-dimensional opera- 
tor A, in the framework of q(N"). 

To determine the form of A, it is sufficient to consider 
the effect of A on product states of the form ® v' 2 ' 

cjivf 5 . If the 



of the composite system, where A^v^ 
composite system is in such a state, then subsystem 1 is 
in state vj . Therefore, when measurement A' 1 ' is per- 
formed, result a; is obtained with certainty, and the state 
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of subsystem 1 is unchanged (up to an irrelevant overall 
phase). Therefore, the state of the composite system 
remains unchanged. If we require that A has eigenvec- 
tors vj 1 ^ <g> v( 2 ', with respective eigenvalues <Zj, it follows 
that A can be taken to be A (1) <g> l (2) , where l (2) is the 
identity matrix in the model of subsystem 2, with the 
only freedom being a physically irrelevant overall phase 
in each of the eigenstates of A^\ 

The above result trivially generalizes to the case of a 
measurement performed on one subsystem of a composite 
system consisting of d subsystems. 



2. Degenerate measurements 

The model q(AT), whose explicit mathematical form 
has been derived above, applies to an abstract set-up 
where the measurements, chosen from the set A, have N 
possible results and therefore, by the distinctness as- 
sumption of Sec. IIII A[ necessarily have N distinct val- 
ues. From the above discussions, it follows that each 
measurement A £ A is represented by a non-degenerate 
Hermitian operator of dimension N. 

Now, it is useful to be able to describe measurements, 
within the context of model q(iV), which have fewer 
than N results. An example of such measurements that 
we have discussed above are subsystem measurements. 
We shall now broaden the discussion to allow for mea- 
surements with N' < N possible results where N' is not 
a multiple of N and which therefore cannot be regarded 
as subsystem measurements. 

Consider an abstract set-up where a preparation imple- 
mented using a measurement from A is followed by mea- 
surement A, whose probabilities are denoted pi, . . . ,pn- 
Suppose that, if measurement B (with N' < N pos- 
sible results) replaces measurement A, the probabili- 
ties, p'xi ■ ■ ■ ,p'n' °f measurement B can be determined 
from the pi by a many-to-one map of the results of A to 
the results of B. For example, in the case where A" = 3 
and N' = 2, the map from the results of A to the results 
of B might consist in 1 — > 1', 2 — > 2' and 3 — > 2', in 
which case p[ — p\ and p' 2 = P2 + P3- In such a case, 
we shall say that measurement B is a degenerate form of 
measurement A; or, more simply, that measurement B 
is a degenerate measurement. 

Now, measurement B can formally be treated as if it 
has N possible results, but where some of these results 
have the same value. In this mode of description, in the 
above example, one can maintain a one-to-one map be- 
tween the results of A and of B (so that 1 — ► 1', 2 — ► 2' 
and so on), but label the results of B with their val- 
ues, and, when computing the probabilities of B, group 
together the results which have the same value. In the 
above example, one would label the three results with 
values b%, b% and 63, respectively, and but have bi = b 3 . 

Since measurement B is a degenerate form of measure- 
ment A, it can be represented by the A-dimensional de- 



generate Hermitian operator B = Xi ^i v i v I j where A v.; = 
a^i. The probabilities for measurement B can then be 
computed in the usual way, on the understanding that 
those results with the same values must not be regarded 
as physically distinguishable, but must be grouped as just 
described. 

Conversely, in an abstract set-up where A contains 
measurements represented by all possible non-degenerate 
Hermitian operators, a degenerate Hermitian operator 
can be regarded as representing a measurement which is 
a degenerate form of some measurement in A. 



F. A Transformation-Invariant Metric and 
Measure over State Space 

When the state Q in Eq. (f2"6| is written in complex 
form v = (Qi + iQ 2 , . . . , Q2N-1 + «Q2Jv) T , the met- 
ric ds 2 = Y. q dQ 2 q over Q 2N becomes ds 2 = |dv| 2 in C N . 
By construction, this metric is invariant under arbitrary 
unitary or antiunitary transformations. 

Furthermore, the metric over Q 2N induces a mea- 
sure that is uniform over S 21 * -1 , and that inherits the 
transformation-invariance of the metric. 



VI. DISCUSSION 

A. Discussion concerning individual outcomes and 
their polarities 

The Complementarity and States postulates assert 
that, when measurement A is performed, there are 2N 
possible outcomes, each with an associated polarity (a 
binary degree of freedom), but that, when outcomes ia 
or ib occur (with their respective polarities), only the re- 
sult i is registered. The above derivation of the quantum 
formalism lends support to the plausibility of these as- 
sertions, but raise the immediate question as to why the 
measurement does not (or cannot) resolve the individual 
outcomes and their associated polarities, and how one 
can more intuitively understand their physical meaning. 
A tentative answer to these questions is as follows. 



1. Unobservability of the individual outcomes and their 
polarities. 

First, as we shall derive in Paper II, the overall 
phase, <fr, of a system in an eigenstate of energy, E, 
changes at the rate E/h. Consequently, the probabili- 
ties Pn-\ and P 2 i of outcomes 2i— 1 and 2i are oscillating 
at frequency 2E/h, and the polarities of these outcomes 
are switching at frequency E/h. Now, it is reasonable to 
suppose that, if one wishes to observe the realization of 
outcomes ia or ib and their polarities, the measurement 
performed must have a temporal resolution At <C h/2E. 
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Conversely, if the measurement does not have such reso- 
lution, it seems reasonable to suppose that one outcome 
and its polarity will not be cleanly realized, but rather 
both outcomes and their polarities will be realized over 
the duration of the measurement, leading to the situa- 
tion where only the property that both outcomes hold in 
common, namely the result i, can be observed, with the 
other properties (namely a or b, and the polarity + or — ) 
being 'smeared out'. 

Now, according to the energy-time uncertainty rela- 
tion AEAt > h/2 [42], the energy associated with the 
interaction used to implement a measurement with the 
temporal resolution needed to observe the individual out- 
comes and polarities has uncertainty AE > hh/At, so 
that AE > E/2n. From E = mc 2 , it then follows 
that AE must be of the order of the rest energy of the 
system. A measurement of such energy would therefore 
probably not preserve the integrity of the system, thereby 
violating the assumption made in Sec. IIIIAl that interac- 
tions preserve the integrity of the system. Hence, a mea- 
surement with the requisite temporal resolution cannot 
be consistently described within the quantum formalism. 
Conversely, a measurement which, with high probability, 
preserves the integrity of the system, will have insufficient 
temporal resolution to resolve the individual outcomes 
and their polarities. 



2. Physical meaning of the outcomes and polarities 

The outcomes and polarities can be visualized as fol- 
lows. Since the values of the ^-dependent state degrees 
of freedom, q a u and q b ^, are unaffected by the addition of 
a multiple of 27r to fa , the physically relevant information 
about fa consists of its value mod 2tt. Furthermore, the 
measure over fa is uniform. Hence, fa can be faithfully 
represented as shown in Fig. [7] In the figure, the value 
of fa in the range [0, 2tt] is shown, and the measure over 
this interval is uniform, as is implicit in the Euclidean 
representation. When measurement A is performed, the 
possibilities a and b for given i register which axis the vec- 
tor is found to be pointing along, and the polarities, + 
and — , determine whether the vector is pointing along 
the positive or negative direction along the respective 
axis. 

The probability that a is obtained is the same as the 
probability that a photon plane-polarized at angle fa 
to the horizontal passes a horizontally-inclined filter, 
namely cos 2 fa . This visual representation suggests that 
the unobserved features of the outcomes ia and ib may 
be connected to Einstein's idea massive particles can be 
regarded as energy that is 'trapped' in a region of space 
and is undergoing rapid, to-and-fro motion of some kind, 
which accounts for their inertia (43j |. to Hestenes' con- 
tention that an electron consists in a localized circular 
motion which accounts for electron spin [2l|, [22j , and to 
the Zitterbewegung of an electron that is seen in the so- 
lution of the Dirac equation. The exploration of these 
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FIG. 7: A vector in afc-space inclined at angle <j>i to the a- 
axis. When measurement A is performed, outcome a or b is 
obtained with probability cos 2 4>i and sin 2 (f>i, respectively. 



tentative suggestions remains for future work. 



B. Some Implications of the Deduction 

1. The Role of Information in Quantum Theory 

One of the major objectives of the program of deriv- 
ing quantum theory using the concept of information is 
to determine whether the concept of information is indis- 
pensable to our understanding of the quantum formalism, 
and, if so, to illuminate the precise relationship between 
the concept of information and the quantum formalism. 

On the first issue, although many recent approaches 
to derive the quantum formalism involve the concept of 
information, the conclusion that information is indispens- 
able to our understanding of the quantum formalism can- 
not be drawn, either because the approaches are unable 
to obtain the quantum formalism (even though they are 
able to derive specific results, such as Malus' law), or be- 
cause, in those approaches that are able to obtain a sig- 
nificant fraction of the quantum formalism, the abstract 
nature of some of the assumptions that are employed ob- 
scures the role played by information in determining the 
formalism. Indeed, further doubt on the need for infor- 
mation is cast by other recent approaches, most notably 
due to Hardy [23, [24[ , that are successful in deriving a 
significant fraction of the quantum formalism without ex- 
plicitly invoking the concept of information. 

The formulation presented here provides significant 
new insight into this issue. The formulation rests on as- 
sumptions that are transparent and that are, to a large 
extent, traceable to familiar or well-established experi- 
mental facts or theoretical idea, so that, for example, ab- 
stract assumptions that directly introduce complex num- 
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bers are avoided. Consequently, the role played by infor- 
mation in the derivation can be clearly seen. Further- 
more, one can see that its role is sufficiently widespread 
that it seems very likely that the concept of information 
could indeed have a fundamental role to play in our un- 
derstanding of the origin of the quantum formalism. 

On the second issue, owing to the transparency of the 
assumptions that form the basis of the derivation, it is 
apparent from the derivation that the concept of informa- 
tion, via the information metric, plays a substantial role 
in giving rise to various structural features of the quan- 
tum formalism. First, we have seen that, if the space of 
probability distributions is endowed with the information 
metric, a transformation to the space where the distribu- 
tions are parameterized by the square-roots of probability 
is endowed with a Euclidean metric. Hence, even at the 
level of classical probability theory, it appears, just as one 
sees in the quantum formalism, that the square-roots of 
probability have a rather fundamental significance 

Second, once a quantum state is represented as a unit 
vector, Q = (y/plf(xi), y/Pif(Xi), \/Pjv/(xaO) T , in a 
2iV-dimension real Euclidean space, the imposition that 
the measure over Xi is (which follows from the re- 
quirement that the measure induced by the metric over 
the Q is consistent with the global gauge condition) 
leads immediately to the functions f(xi) — cos(axi + b) 
and f(xi) = sin(axi + b). Hence, the sinusoidal func- 
tions into which the phases, fa — axi + b, in a quantum 
state, v = (y/pTe^ , . . . , ^e^* ) T , enter can be directly 
traced to the concept of information. 

Third, the requirement that transformations of the 
state space preserve the metric leads to the conclusion 
that the transformations must be orthogonal. As we have 
shown, it is then only necessary to impose the Gauge 
Invariance postulate to restrict the set of allowed trans- 
formations to a subset of the orthogonal transformations 
in order to obtain a one-to-one correspondence between 
the allowed transformations and the set of unitary and 
antiunitary transformations of a complex vector space. 



2. Insights into the Quantum Formalism 

The derivation provides a number of significant in- 
sights into the quantum formalism, of which we mention 
only a few. First, we note that, since the development of 
the quantum formalism, there has been some uncertainty 
as to whether the formalism is the most general formal- 
ism for the description of quantum phenomena in flat 
space-time. Various possibilities have been suggested for 
the generalization of the formalism which, from a purely 
mathematical point of view, seem to be plausible, and 
which may have interesting physical consequences. For 
example, the possibility of non-unitary temporal evolu- 
tion has been considered by several authors [H, [26|, |27| • 
The derivation given above gives rise to a mathemati- 
cal structure that is neither more nor less general than 
the finite-dimensional abstract quantum formalism, and 



thereby lends support to the view that the quantum for- 
malism is the most general formalism for the description 
of quantum phenomena in flat space-time. 

Constructively, the formulation has the potential to 
allow one to explicitly work out the effect that specific 
changes to particular postulates would have upon the 
quantum formalism. For example, if one wishes to mod- 
ify the quantum formalism so as to allow continuous 
transformations to be represented by non-unitary trans- 
formations, one could identify which postulates could be 
plausibly modified (in the present example, the Gauge 
Invariance Postulate would seem to be the obvious can- 
didate) , and explicitly follow through the changes in the 
formalism that result. 

The second insight is based on the fact that antiunitary 
transformations are not generally regarded as an integral 
part of the abstract quantum formalism (as formalized, 
for instance, by Dirac or von Neumann), but are instead 
usually introduced by reference to the theorem of Wigner 
mentioned in the Introduction. Since this theorem rests 
on particular assumptions, this raises the question as to 
whether antiunitary transformations are as fundamental 
as unitary transformations. However, since unitary and 
antiunitary transformations emerge on an equal footing 
in the derivation presented above, the derivation suggests 
that antiunitary transformations are an equally funda- 
mental part of the quantum formalism. 

Third, the prevalence of complex numbers in the quan- 
tum formalism is perhaps its most mysterious mathe- 
matical features. The emergence of complex numbers in 
the derivation depends on most of the postulates, and so 
is not easy to unravel, but the role of the global gauge 
invariance condition is perhaps the most obvious: prior 
to the imposition of the two postulates based on this 
condition (namely, the Gauge Invariance and the Mea- 
sure Invariance postulates), the state space is S 21 ^^ 1 in 
a 2iV-dimensional real space, and the set of all possible 
transformations is the set of orthogonal transformations. 
However, these two postulate restrict the set of all pos- 
sible transformations of Q 2N to a subset of the orthog- 
onal transformations, and thereby allows the set of all 
possible transformations to be represented by the set of 
all unitary and antiunitary transformations of a suitably 
defined complex vector space. Hence, from this perspec- 
tive, it appears that the use of complex numbers in the 
quantum formalism is directly tied to the set of possible 
transformations of state space, and is critically depen- 
dent upon the global gauge invariance condition. 

Fourth, we have seen that the quantum formalism can 
be represented in a real form, where a pure state is rep- 
resented by a unit vector on the unit hypersphere in 
a real 2iV-dimensional space, and where allowed trans- 
formations are a subset of the set of orthogonal trans- 
formations of the unit sphere. Such a simple, easy vi- 
sualizable representation, which does not appear widely 
known, may prove useful in certain contexts. For exam- 
ple, in this representation, one can immediately see that 
a uniform measure over the unit hypersphere, S 2N_1 , 
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is invariant under all orthogonal transformations, and is 
therefore unitarily- invariant. 

Fifth, it has been suggested [|[ that the quantum for- 
malism may owe at least a significant part of its structure 
to the fact that quantum theory permits non-locality and 
no-signaling to peacefully coexist, and a number of recent 
reconstructive approaches [H, [H, H3 rely upon postu- 
lates that concern the behavior of physically separated 
sub-systems to arrive at the key mathematical features 
of the quantum formalism. However, the above deriva- 
tion suggests that such considerations are not essential to 
the development of a physical understanding of the core 
of quantum formalism. 

VII. CONCLUSION 

In this paper, we have shown that the finite- 
dimensional abstract quantum formalism (apart from the 
explicit form of the temporal evolution operator) can be 
derived within the framework of information geometry 
using features of quantum phenomena which can be phys- 
ically understood by reference to experimental observa- 
tions and general theoretical principles. The derivation 



illuminates the physical origin of the quantum formal- 
ism, suggests that information plays a key role in giving 
rise to the quantum formalism, and potentially has sig- 
nificant implications for the interpretation and proposed 
modifications of quantum theory. 
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